An experimental analysis on the similarity of argumentation semantics

Cerutti, Federico; Thimm, Matthias; Vallati, Mauro

doi:10.3233/AAC-200907

An experimental analysis on the similarity of argumentation semantics

Article type: Research Article

Authors: Cerutti, Federico^a | Thimm, Matthias^{b; *} | Vallati, Mauro^c

Affiliations: [a] Department of Information Engineering, University of Brescia, Brescia, Italy. E-mail: [email protected] | [b] Institute for Web Science and Technologies, University of Koblenz-Landau, Germany. E-mail: [email protected] | [c] School of Computing and Engineering, University of Huddersfield, Huddersfield, United Kingdom. E-mail: [email protected]

Correspondence: [*] Corresponding author. E-mail: [email protected].

Keywords: Approximate algorithm, experimental analysis, Jaccard’s distance

DOI: 10.3233/AAC-200907

Journal: Argument & Computation, vol. 11, no. 3, pp. 269-304, 2020

Published: 10 November 2020

Get PDF

Abstract

In this paper we ask whether approximation for abstract argumentation is useful in practice, and in particular whether reasoning with grounded semantics – which has polynomial runtime – is already an approximation approach sufficient for several practical purposes. While it is clear from theoretical results that reasoning with grounded semantics is different from, for example, skeptical reasoning with preferred semantics, we investigate how significant this difference is in actual argumentation frameworks. As it turns out, in many graphs models, reasoning with grounded semantics actually approximates reasoning with other semantics almost perfectly. An algorithm for grounded reasoning is thus a conceptually simple approximation algorithm that not only does not need a learning phase – like recent approaches – but also approximates well – in practice – several decision problems associated to other semantics.

1.Introduction

Dung’s theory of abstract argumentation [17] unifies a large variety of formalisms in nonmonotonic reasoning, logic programming and computational argumentation. It is based on the notion of an argumentation framework (AF) that consists of a set of arguments and of an attack relation between them. Different argumentation semantics introduce the criteria to determine which arguments emerge as justified from the conflicts, by identifying a number of extensions, i.e. sets of arguments that can survive the conflicts together. In [17] four traditional semantics are introduced, namely grounded, complete, stable, and preferred semantics. Other literature proposals include semi-stable [5,36] and ideal semantics [18]. For an introduction on the various semantics, see [2]. Several problems are associated to each semantics, notably credulous and skeptical acceptance of an argument with respect to a given argumentation framework – i.e. determining whether an argument belongs to at least one (resp. every) extension – and enumeration of all or some extensions given an argumentation framework. Among those semantics, grounded semantics prescribes a unique extension which can be computed in polynomial time, thus all the problems related to grounded semantics are easy to solve. Instead, decision problems associated to the other semantics are much more complex, with some at the second level of the polynomial hierarchy (see also Section 2).

To our knowledge, only a few work addressed the problem of approximating the solution of some decision or enumeration problems associated to an argumentation semantics, despite its paramount importance for existing argumentation-based decision support systems. For instance, CISpaces [32] is an argumentation-based research-grade prototype for supporting intelligence analysts in their sense-making process, under consideration for transitioning into a commercial product by the U.S. Army Research Laboratory. It makes extensive use of preferred extensions – seen as coherent views of the pieces of information collected by the analyst – hence being in the position to exploit fast, reliable, approximators would clearly be an important feature to improve the user experience.

In [35] predictive models have been positively exploited in abstract argumentation for predicting significant aspects, such as the number, of the solution to the preferred extensions enumeration problem, where the complete knowledge of such structure would require a computationally hard problem to be solved. In [26], an approximation algorithm for credulous reasoning with preferred/complete semantics is presented. That algorithm is based on learning a graph convolutional neural network [25] from a set of correctly solved benchmark instances and then using the learned network as an approximation algorithm. The advantage is that runtime drastically decreases (basically to linear runtime, given the learned network) while classification accuracy is still at a reasonable 80% or more in certain cases, i.e. 80% of all arguments of a certain input argumentation framework were correctly classified as credulously accepted or not. The work [26] thus showed that it is generally feasible to employ this methodology for developing approximation algorithms for hard problems in abstract argumentation. Using more recent approaches from the deep learning community and increasing efforts in streamlining this methodology will probably increase classification accuracy further, see [13,28] for approaches in this direction.

It has to be noted that the methodologies used in works such as [13,26,28] are conceptually complex, requiring sophisticated learning algorithms and complex deep learning models, and need additional time for the learning phase. In the present paper, we ask the question whether such a complexity is necessary in practice. More concretely, we ask the question whether reasoning with grounded semantics, which has polynomial runtime, is not already a sufficient approximation approach. While it is clear from theoretical results (Section 2) that reasoning with grounded semantics is different from, for example, skeptical reasoning with preferred semantics, we wish to investigate how significant this difference is in actual argumentation frameworks (Section 3). As it turns out, in many graphs models (Section 4) reasoning with grounded semantics actually approximates reasoning with other semantics almost perfectly (Section 5). An algorithm for grounded reasoning is thus a conceptually simple approximation algorithm that does not need an expensive learning phase but turns out to have high classification accuracy as well. Our results provide even more general insights. We can observe that many semantics coincide with some others on almost all of our benchmarks. For example, credulous reasoning with preferred semantics coincides with credulous reasoning with semi-stable semantics almost perfectly. This allows for reasoning systems tailored for preferred semantics to be used for semi-stable semantics as well in practice, even if the latter problem is computationally more difficult than the former.

2.Background

An argumentation framework [17] consists of a set of arguments1 1 and a binary attack relation between them.

Definition 1.

An argumentation framework (AF) is a pair Γ=⟨A,R⟩ where A is a set of arguments and R⊆A×A. We say that b attacks a iff (b,a)∈R, also denoted as b→a. The set of attackers of an argument a is denoted as a−≜{b:b→a}, the set of arguments attacked by a is denoted as a+≜{b:a→b}. An argument a without attackers, i.e. such that a−=∅, is said initial. We also extend attack notations to sets of arguments, i.e. given E,S⊆A, E→a iff ∃b∈E s.t. b→a; a→E iff ∃b∈E s.t. a→b; E→S iff ∃b∈E, a∈S s.t. b→a; E−≜{b∣∃a∈E,b→a} and E+≜{b∣∃a∈E,a→b}.

Each argumentation framework has an associated directed graph where the vertices are the arguments, and the edges are the attacks.

The basic properties of conflict–freeness, acceptability, and admissibility of a set of arguments are fundamental for the definition of argumentation semantics.

Definition 2.

Given an AF Γ=⟨A,R⟩:

a set S⊆A is a conflict–free set of Γ if ∄a,b∈S s.t. a→b;
an argument a∈A is acceptable in Γ with respect to a set S⊆A if ∀b∈A s.t. b→a, ∃c∈S s.t. c→b;
the function FΓ:2A→2A such that FΓ(S)={a∣a is acceptable in Γ w.r.t. S} is called the characteristic function of Γ;
a set S⊆A is an admissible set of Γ if S is a conflict–free set of Γ and every element of S is acceptable in Γ with respect to S, i.e. S⊆FΓ(S).

An argumentation semantics σ prescribes for any AF Γ a set of extensions, denoted as Eσ(Γ), namely a set of sets of arguments satisfying the conditions dictated by σ. The paper is focused on grounded (denoted as GR), stable (ST), preferred (PR) semantics, introduced in [17]; as well as on semi–stable (SST), originally introduced with the name of admissible argumentation stage extension in [36] and then re-named in [5,6]; and on ideal (ID), originally introduced in [18].2 2

Definition 3.

Given an AF Γ=⟨A,R⟩:

a set S⊆A is the grounded extension of Γ i.e. {S}=EGR(Γ), iff S is the minimal (w.r.t. set inclusion) fixed point of FΓ;
a set S⊆A is a stable extension of Γ, i.e. S∈EST(Γ), iff S is a conflict-free set of Γ and S∪S+=A;
a set S⊆A is a preferred extension of Γ, i.e. S∈EPR(Γ), iff S is a maximal (w.r.t. set inclusion) admissible set of Γ;
a set S⊆A is a semi–stable extension of Γ, i.e. S∈ESST(Γ), iff S is a preferred extension where S∪S+ is maximal (w.r.t. set inclusion) among all preferred extensions;
a set S⊆A is the ideal extension of Γ, i.e. {S}=EID(Γ), iff S is the maximal (w.r.t. set inclusion) admissible set of Γ that is also subset of each preferred extension.

An argument a is credulously (resp. skeptically) accepted with regard to a given semantics σ and a given AF Γ iff a belongs to at least one (resp. each) extension of Γ under σ. Let denote with σΓ-C (resp. σΓ-S) the set of all the credulously (resp. skeptically) accepted arguments of Γ according to σ, i.e., if ∃S∈Eσ(Γ), a∈S, then a∈σΓ-C; and if ∀S∈Eσ(Γ) a∈S, then a∈σΓ-S. Note that in the case no stable extension exists, STΓ-S=A.

With a slight abuse of notation, and begging the reader for forgiveness, we will write GRΓ=GRΓ-C=GRΓ-S and IDΓ=IDΓ-C=IDΓ-S as both grounded and ideal are unique. Also, when it applies to generic Dung’s argumentation framework, or when it is clear from the context, we will also drop the reference to a given AF, hence for instance we will write PR-C to refer to PRΓ-C where Γ is a generic, unspecified Dung’s argumentation framework, or the specific Dung’s argumentation framework we are discussing in a specific portion of text.

In [4] the notion of skepticism has been formally investigated. As the author themselves discuss in their paper, “skepticism is related with making more or less committed evaluations about the justification state of arguments in a given situation: a more skeptical attitude corresponds to less committed (i.e. more cautious) evaluations.” An extension E1 is at least as skeptical as an extension E2 if E1⊆E2, since then E1 supports the acceptance of no more arguments than E2.

Definition 4.

Given two extensions E1 and E2 of an argumentation framework Γ, E1 is at least as skeptical as E2, denoted as E1⪯E2 if and only if E1⊆E2.

This notion suffices in the case of grounded and ideal semantics as they are unique. In [4] the authors introduced also the following two relations between non-empty sets of extensions3 3 based to skeptical and credulous acceptance.

Definition 5.

Given two non-empty sets of extensions E1 and E2 of an argumentation framework Γ, E1⪯∩E2 if and only if ⋂E1∈E1E1⊆⋂E2∈E2E2.

Definition 6.

Given two non-empty sets of extensions E1 and E2 of an argumentation framework Γ, E1⪯∪E2 if and only if ⋃E1∈E1E1⊆⋃E2∈E2E2.

Figure 1 summarises the skeptical relationships that exists between different semantics extensions [4] when at least one stable extension exists.4 4 In it, for instance, we can see that GR⪯∩ID⪯∩PR⪯∩ST≡SST.

Fig. 1.

⪯∩ (left) and ⪯∪ (right) relation for frameworks where the stable extensions exist.

Table 1

Complexity of traditional decision problem on Dung’s abstract argumentation

σ	a∈?σ-C	a∈?σ-S
GR	P-complete	P-complete
ST	NP-complete	coNP-complete
PR	NP-complete	Π2P-complete
SST	Σ2P-complete	Π2P-complete
ID	Θ2P-complete	Θ2P-complete

3.Measuring relative skepticism

If we have a look at the computational complexity of decision problems associated to Dung’s argumentation framework – see [19] for an extensive analysis – we can see (cf. Table 1) that many decision problems cannot be solved in deterministic polynomial time, except for the case of grounded semantics.

Building on top of Fig. 1 – that assumes the existence of at least one stable extension – we can easily derive a ⊆ ordering between sets of credulously and skeptically accepted arguments according to the semantics we consider in this paper.

Proposition 1.

Given an argumentation framework Γ for which EST(Γ)≠∅

GRΓ⊆IDΓ⊆PRΓ-S⊆STΓ-S≡SSTΓ-S⊆STΓ-C≡SSTΓ-C⊆PRΓ-C

Figure 2 illustrates the result of Proposition 1. We now need to be able to quantify the distance between such sets, so to have an indication of how much the grounded extension covers the other sets. In another line of work, Doutre and Mailly [16] already investigated a similar problem from an analytical perspective. More specifically, they developed difference measures between semantics that take aspects such as computational complexity, formal properties, and other features into account. However, we want to investigate the difference between the sets of accepted arguments empirically in order to assess the practical relevance of such results. For this reason we rely on the statistic provided by the Jaccard’s index [24] that quantifies the similarities between sets.5 5 It is defined as the size of the intersection divided by the size of the union of the sample sets.

Fig. 2.

Hasse diagram of the relationship between sets of credulously and skeptically accepted arguments w.r.t. GR, ST, PR, SST, and ID for argumentation frameworks admitting at least one stable extension.

Definition 7

Definition 7(Jaccard’s Index and Distance, derived from [24]).

Given two sets A and B, their Jaccard’s Similarity Coefficient is:

J(A,B)=|A∩B||A∪B|

Their Jaccard’s distance is then:

Jδ(A,B)=1−J(A,B)

Therefore, the set of all the sets of credulously and skeptically accepted arguments for an AF form a metric space, independently of the existence of stable extensions.

Proposition 2.

Given an AF Γ, ⟨{GRΓ,STΓ-C,STΓ-S,PRΓ-C,PRΓ-S,SSTΓ-C,SSTΓ-S,IDΓ},Jδ⟩ is a metric space.

Proof.

It follows from results in [27]. □

From Figs 1 and 2 one can say that grounded is the most skeptical semantics possible (among those considered). We can then define the measure of relative (to GR) skepticism of a set of credulously or skeptically accepted arguments according to a semantics as the Jaccard’s distance from the grounded extension.6 6

Definition 8

Definition 8(Measure of relative skepticism).

Given an AF Γ and given σ˜Γ∈{GRΓ,STΓ-C,STΓ-S,PRΓ-C,PRΓ-S,SSTΓ-C,SSTΓ-S,IDΓ}, its measure of relative (to GR) skepticism μS is defined as:

μS(σ˜Γ)=Jδ(σ˜Γ,GRΓ)

The following propositions show properties of this measure: proofs are omitted as straightforward. In particular, this measure is a function whose range is the set of real numbers between 0 and 1 (both included).

Proposition 3.

Given an AF Γ and given σ˜Γ∈{GRΓ,STΓ-C,STΓ-S,PRΓ-C,PRΓ-S,SSTΓ-C,SSTΓ-S,IDΓ}, μS(σ˜Γ)=[0,1].

Also, the measure of relative skepticism has a global minimum point in correspondence of the grounded semantics.

Proposition 4.

Given an AF Γ, μS(GR)⩽μS(σ˜Γ), ∀σ˜Γ∈{GRΓ,STΓ-C,STΓ-S,PRΓ-C,PRΓ-S,SSTΓ-C,SSTΓ-S,IDΓ}.

However, such a global minimum point, in general, is not unique as the grounded extension might coincide with some other set of credulously or skeptically accepted arguments σ˜Γ.

Proposition 5.

Given an AF Γ, ∀σ˜Γ∈{GRΓ,STΓ-C,STΓ-S,PRΓ-C,PRΓ-S,SSTΓ-C,SSTΓ-S,IDΓ}, if μS(σ˜Γ)=0 then σ˜Γ=GRΓ.

Finally, it follows that in the case of acyclic free AFs, each set of credulously or skeptically accepted arguments has zero as measure of relative skepticism, consistently with the results provided in [19].

Proposition 6.

Given an acyclic AF Γ, ∀σ˜Γ∈{GRΓ,STΓ-C,STΓ-S,PRΓ-C,PRΓ-S,SSTΓ-C,SSTΓ-S,IDΓ} μS(σ˜Γ)=0.

4.Benchmarks

To experimentally analyse the measure of relative skepticism (Definition 8), we considered a significantly large experimental setting with a great variety of benchmarks, so to cover the vast majority of benchmarks currently used in abstract argumentation.7 7

4.1.A million AFs

We exhaustively generated the first million of AFs with increasing size using the EnumeratingDungTheoryGenerator from TweetyProject.8 8 This generator enumerates all possible combinations of attacks with an increasing number of arguments. In the following, we collectively refer to this group of AFs as aMillion. Figure 3 illustrates the first six of them. Note that the AFs in this group are rather small (the largest contains only five arguments), but there is no underlying model in the generation process, making this group unbiased on any assumption about the generation process.

Fig. 3.

The first six argumentation frameworks with increment size generated using the Tweety Generator.

4.2.Structured AFs

Fig. 4.

One of the smallest examples of ABA-derived Dung’s AFs in the ICCMA 2017 benchmarks.

We also considered the 426 Assumption-Based argumentation frameworks translated to Dung’s argumentation framework that have been submitted to the ICCMA 2017.9 9 This benchmark restricted the ABA benchmark provided in [14]10 10 to those with at most 1,500 arguments. In the following, we collectively refer to this group of AFs as sABA. Figure 4 depicts one of the smallest.

Fig. 5.

Example of a Dung’s AF generated from an ASPIC-like theory.

We considered 300 ASPIC-like instances generated using TweetyProject.11 11 In the following, we collectively refer to this group of AFs as sASPIC. Listing (1) in Appendix A lists an example of ASPIC-like theories used for generating the Dung’s AF depicted in Fig. 5; Appendix A also contains the necessary background on ASPIC [30] and the generation process.

4.3.Random AFs with palatable argumentative characteristics

Fig. 6.

One of the smallest example of Dung’s AFs generated controlling parameters related to strongly connected components.

We randomly generated, using the probo [9] SccGenerator 2,400 AFs with a controllable number structure in terms of strongly connected components. In a first step, a specified number of arguments are partitioned (with a uniform distribution) into components C1,…,Cn. Within each component attacks are added randomly with a high probability given as a parameter (and thus likely forming a strongly connected component). In-between components attacks are randomly added with less probability (also given as parameter), but only from a component Ci to Cj with i>j (in order to avoid having few large strongly connected components).12 12 In the following, we collectively refer to this group of AFs as rSCC. Figure 6 depicts one of the smallest.

We also randomly generated 200 AFs featuring a large number of stable extensions – and clearly of preferred extensions as well – using probo’s StableGenerator. This generator first identifies a set of arguments grounded (of size uniformly distributed between a given interval) to form an acyclic subgraph which will contain the grounded extension. Afterwards another subset M (a candidate for a stable extension) of arguments (of size uniformly distributed between a given interval) is randomly selected and attacks are randomly added from some arguments within M to all arguments neither in M nor grounded. This process is repeated until a number of desired stable extensions is reached.13 13 In the following, we collectively refer to this group of AFs as rStable. Figure 7 depicts an example of such AFs.

Fig. 7.

An example of AFs featuring a large number of stable extensions.

4.4.Random graphs as AFs

Finally, we consider random graphs generators proposed in literature as a way to generate random AFs as well.

We generated 599 AFs according to the Erdös-Rényi [20] model – with edges between arguments randomly selected according to a uniform distribution – varying the number of arguments between 20 and 200, with an increment of 20, and with probability of attacks fixed as {0.01,0.05,0.1}.14 14 In the following, we collectively refer to this group of AFs as rER. Figure 8 depicts an example of Erdös-Rényi-like AFs.

Fig. 8.

An example of Erdös–Rényi-like AFs.

We also generated 360 AFs according to the Barabasi–Albert [1] model, varying the number of arguments between 20 and 200 with an increment of 20; and enforcing the probability to have at least one argument belonging to a cycle in the range {0.1,0.2,0.3}.15 15 The Barabasi–Albert model enforces the common property of many large networks that the node connectivities follow a scale-free power-law distribution. This is generally the case when: (i) networks expand continuously by the addition of new nodes, and (ii) new nodes attach preferentially to sites that are already well connected. In the following, we collectively refer to this group of AFs as rBA. Figure 9 depicts an example Barabasi–Albert-like AFs.

Fig. 9.

An example of Barabasi–Albert-like AFs.

Finally, we considered the Watts–Strogatz [37] model, where a ring of n arguments where each argument is connected to its k nearest neighbors in the ring. k must satisfy n>k>log(n)>1 to ensure a connected graph. Also, each edge is randomly rewired with a probability β. Indeed, Watts and Strogatz [37] show that many biological, technological and social networks are neither completely regular nor completely random, but something in the between. They thus explore simple models of networks that can be tuned through this middle ground: regular networks rewired to introduce increasing amounts of disorder. These systems can be highly clustered, like regular lattices, yet have small characteristic path lengths, like random graphs, and they are named small-world networks by analogy with the small-world phenomenon. We generated 1,800 AFs according to the Watts–Strogatz model varying the number of arguments between 20 and 200 with an increment of 20; enforcing the probability to have at least one argument belonging to a cycle in the range {0.1,0.2,0.3}; setting k equal to half of the number of arguments; and varying β in {0.2,0.4,0.6}.16 16 In the following, we collectively refer to this group of AFs as rWS. Figure 10 depicts an example of Watts–Strogatz-like AFs.

Fig. 10.

An example of Watts–Strogatz-like AFs.

5.Empirical analysis

To inform useful considerations, let us introduce the averaged measure of relative skepticism over a set B of AFs.

Definition 9.

Let B be a set of AFs, given a semantic σ∈{GR,ST,PR,SST,ID}, μSB is the averaged measure of relative skepticism w.r.t. B defined as follows:

μSB(σ-C)=∑Γ∈BμS(σΓ-C)|B|

and

μSB(σ-S)=∑Γ∈BμS(σΓ-S)|B|

5.1.Empirical evaluation of the averaged measure of relative skepticism

Figure 11 graphically summarises the averages of the measure of relative skepticism μS over the benchmarks we introduced in Section 4: full results for each of the benchmark groups are provided in Appendix B. First of all, it is worth mentioning that – except for the case of ST-S – the ⊆ ordering discussed in Proposition 1 is naturally maintained. Also, the measure of relative skepticism is bounded between 0 and 1 (cf. Proposition 3): since μS(GRΓ)=0 for any AF Γ (cf. Proposition 4), we omit it from Fig. 11.

Fig. 11.

Measure of relative skepticism μS of sets of credulously or skeptically accepted arguments according to the semantics discussed in Definition 3 and over the benchmarks described in Section 4. GR is omitted as μS(GRΓ)=0 for any AF Γ.

Fig. 12.

0.05-clusters of averaged measures of relative skepticism μS of sets of credulously or skeptically accepted arguments according to the semantics discussed in Definition 3 and over the benchmarks described in Section 4.

As we can see in Fig. 11, there are case where the averaged measures of relative skepticism appear to cluster closely together. To investigate this further, let us introduce the concept of a ε-cluster, as the set of credulously or skeptically accepted arguments with averaged measure of skepticism all within a chosen ε.

Definition 10.

Let B be a set of AFs, and σ˜1B,σ˜2B∈{GR,ST-C,ST-S,PR-C,PR-S,SST-C,SST-S,ID}, we say that σ˜1B and σ˜2B belong to the same ε-cluster Cε with ε⩾0 if |μSB(σ˜1B)−μSB(σ˜2B)|⩽ε.

Figure 12 provides a qualitative interpretation of Fig. 11 in terms of 0.05-clusters of sets of credulously or skeptically accepted arguments.

From Figs 11 and 12, we can observe the following:

(1) GR on average coincides with the skeptically accepted arguments of any semantics for sASPIC, and rBA. In the same sets it appears that all the credulously accepted set of arguments have the same averaged measure of relative skepticism;
(2) GR on average is a good estimator17 17 of PR-S, SST-S, and ID for sABA, as they all belong to the same 0.05-cluster;
(3) GR might serve as an estimator of all the sets of credulously and skeptically accepted arguments w.r.t. any semantics except for ST-S for rWS: despite not being in the same 0.05-cluster, they would be part of the same 0.1-cluster;
(4) ST-S is furthest apart from GR when considering rSCC and rWS;
(5) PR-C seems consistently to have the same measure of skepticism of SST-C, except in rER;
(6) GR on average does not seem to be a reasonable choice to predict any set of credulously or skeptically arguments for any semantics in the case of rSCC, rStable, and rER.

Figure 13 depicts the distribution of the averages of Jaccard’s distances across all the dataset comparing all the combinations of sets of credulously or skeptically accepted arguments. We can thus see:

(1) PR-S almost always coincide with ID and SST-S in our dataset;
(2) PR-C almost always coincide with SST-C in our dataset.

Fig. 13.

Distributions of Jaccard’s distance – aggregating the averages for each dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

5.2.Empirical analysis of features impacting the measure of relative skepticism

We then question whether there are some specific characteristics that substantially impact the measure of relative skepticism. To this end, following traditional machine learning approaches, we note that information about the structure of an AF can be extracted under the form of features. Each feature summarises a potentially important property of the considered framework, and the whole set of features can be seen as the fingerprint of the AF at hand.

Consistently with other approaches exploiting predictive models [7,10,11,33–35], we decided a large set of features from each of the AFs. In total, the feature set includes 147 values, which exploit the representation of AFs as a graph or as a matrix.18 18 Fig. 14 provides an example AFand its corresponding matrix encoding, from which additional features can be decided. Examples of the considered features include the number of arguments, number of Strongly Connected Components, presence of auto-loops, etc.

Fig. 14.

An example AF, and its matrix encoding on the right.

In order to identify a subset of relevant features, out of the 147 considered, we implemented a three-step approach based on linear regression and feature selection techniques. Linear regression aims at modeling the relationships between the considered features and the value to predict by fitting a linear equation to the available data [21]. Feature selection has been performed using CorrelationAttribute in Weka [22]. The Correlation Attribute technique evaluates the worth of a feature by measuring the correlation (Pearson’s) between it and the value to predict.

The implemented three-step approach consisted of: (i) performing linear regression on the whole set of features; (ii) feature selection, and (iii) linear regression again on the subset of selected features. The idea being of generating a predictive model based on all the possible features in the first step, as a reference model. A subset of features deemed to be informative is then selected (ii), and its usefulness is assessed by generating a second predictive model, that exploits only the subset of features selected, and compare its performance against the model generated at step (i). If the models generated at steps (i) and (iii) show similar performance, we can conclude that the selected subset of features include the informative ones. Further, the use of linear regression can also help in gaining an understanding of the relevance of features, on the basis of the assigned weight.

From the performed analysis, it appears that the most important aspects are: (1) flow hierarchy;19 19 (2) the aperiodicity of the graph;20 20 and (3) the number of SCCs. There are also some informative matrix-related features, but those are much harder to relate to characteristics of the AFs. Appendix C provides additional details broken down per benchmark set.

6.Conclusion

In this paper we provide a first answer to the question whether reasoning with grounded semantics, which has polynomial runtime, is not already a sufficient approximation approach. As it turns out, in many graphs models reasoning with grounded semantics actually approximates reasoning with other semantics almost perfectly. Indeed, our extensive experimental analysis (Section 5) shows that the grounded extension is a very good – sometimes a perfect – estimator of skeptical acceptance of arguments in AFs derived from the two approaches to structured argumentation we considered in this paper, as well as from graphs obeying to the Barabasi–Albert and Watts–Strogatz models, according to all the semantics except for stable. The main reason for this is that stable semantics is the only semantics for which existence is not guaranteed, and this has clearly a substantial effect on skeptical acceptance. Instead, it appears to be not a good predictor in the case of argumentation frameworks with a substantial number of SSCs, with a large number of stable extensions, and derived from graphs obeying to the Erdös-Renyi model. The AFs derived from graphs obeying to the Erdös-Renyi model also seem to be the ones for which the set of arguments credulously accepted according to preferred semantics is different from the set of arguments credulously accepted according to semi-stable semantics, albeit they both belong to the same 0.05-cluster. We also performed an analysis of graph features that could be mostly informative for predicting the correlation results. Unsurprisingly it appears that the most informative ones are connected to the presence of cycles in the graph.

The extensive experimental section of this paper supports the claim that an algorithm for grounded reasoning is thus a conceptually simple approximation algorithm that does not need an expensive learning phase while having a good performance on several instances of AFs, including some of those closely linked to structured argumentation. On the one hand, this can thus help the development of real-world tools using abstract argumentation, where results, albeit approximate, might need to be provided in a near-real-time setting. On the other hand, it helps shed some light about the benchmarks currently used in the community, and provides effective guidance on the hardness of AFs instances. Indeed, one could argue that benchmarks for which the 0.05-clusters of averaged measures of relative skepticism (i.e. Fig. 12) should resemble as much as possible the theoretical results associated to skeptical relationships between semantics, i.e. Fig. 2. From visual inspection, the set of AFs exhibiting a large number of stable extensions, as well as those derived from graphs obeying to the Erdös-Renyi model appear to have the 0.05-clusters distributed in a way similar to the theoretical results we know from the analysis of skepticisms of the various semantics.

Notes

1 In this paper we consider only finite sets of arguments: see [3] for a discussion on infinite sets of arguments.

2 Note that we do not consider complete semantics [17] explicitly, as skeptical reasoning with complete semantics is identical to reasoning with grounded semantics and credulous reasoning with complete semantics is identical to credulous reasoning with preferred semantics, see below.

3 An interested reader is referred to [4] to appreciate the differences with possibly empty sets of extensions.

4 As discussed at length in [4], without such an assumption, no conclusion can be drawn regarding the skeptical relationships between stable and other semantics.

5 Jaccard’s distance is but one of many options for measuring the dissimilarities between finite sets [15]; it appears to us that it is a natural one in virtue of its simplicity and its straightforward connection with the notion of skepticism [4]. Moreover, Jaccard’s distance is also the first non-correlation-based distance proposed in literature [12] and it has been proven to be analogous, a special case or connected concepts of much more elaborated ones, such as the Marczewski–Steinhaus distance [29], the Tanimoto distance [31], and the Horadam–Nyblon distance [23].

6 This definition does not require the existence of at least one stable extension: for each AF Γ=⟨A,R⟩ with EST(Γ)=∅, STΓ-C=∅ and STΓ-S=A.

7 An archive of all benchmarks can be downloaded from http://mthimm.de/misc/exparg_instances.tar.gz.

8 http://tweetyproject.org/api/1.17/net/sf/tweety/arg/dung/util/EnumeratingDungTheoryGenerator.html

9 http://argumentationcompetition.org/2017/ABA2AF.pdf

10 Details and the full set of benchmarks can be found at http://robertcraven.org/proarg/experiments.html (on 18 Aug 2020).

11 http://tweetyproject.org/api/1.17/net/sf/tweety/arg/aspic/util/RandomAspicArgumentationTheoryGenerator.html

12 Parameters used here: arguments=20,40,…,200, numSccs=#arg/5,#arg/10,#arg/20, innerprob=0.6,0.8, outerprob=0.05,0.1.

13 Parameters used: arguments=20,40,…,200, minNum=#arg/20, maxNum=#arg/2, minSize=#arg/10, maxSize=#arg/2, minGround=0, maxGround=#arg/10.

14 AFBenchGen2 [8] parameters numargs=20,40,…,200, and ER_probAttacks=0.01,0.05,0.1.

15 AFBenchGen2 [8] parameters numargs=20,40,…,200, and BA_WS_probCycles=0.1,0.2,0.3.

16 AFBenchGen2 [8] parameters numargs=20,40,…,200, BA_WS_probCycles=0.1,0.2,0.3, WS_baseDegree=(#arg/2), and WS_beta=0.2,0.4,0.6.

17 By good estimator we consider 0.05-clusters.

18 The interested reader is referred to [35] for a detailed description of the features.

19 The fraction of edges not participating in cycles in a directed graph.

20 A graph is aperiodic if there is no k>1 that is a integer divisor of the length of each cycle in the graph.

21 Note also that we depart from ASPIC+ terminology at times.

22 http://tweetyproject.org/api/1.17/net/sf/tweety/arg/aspic/util/RandomAspicArgumentationTheoryGenerator.html

Appendices

Appendix A.

Appendix A.Background in ASPIC and example of ASPIC-like derived Dung’s AF

In the following, we present a minimal variant of the propositional instantiation of ASPIC+ [30]. Note that ASPIC+ is a general framework that can be instantiated using a variety of different logics and is also able to adhere for the inclusion of orderings between rules, but we only stick to a very simple version.21 21

Let L be a finite set of propositions and let Lˆ be the set of literals of L, i.e., Lˆ={a,¬a∣a∈L}. For a∈L define a‾=¬a and ¬a‾=a. ASPIC+ differentiates rules into strict rules (rules that are always supposed to hold) and defeasible rules (rules that “usually” hold).

Definition 11.

A knowledge base K is a pair K=(Ks,Kd) where

Ks is a set of strict rules of the form ϕ1,…,ϕn→ϕ with ϕ1,…,ϕn,ϕ∈Lˆ.
Kd is a set of defeasible rules of the form ϕ1,…,ϕn⇒ϕ with ϕ1,…,ϕn,ϕ∈Lˆ.

A strict rule ϕ1,…,ϕn→ϕ with n=0 is written as →ϕ and is also called an axiom. A defeasible rule ϕ1,…,ϕn⇒ϕ with n=0 is written as ⇒ϕ and is also called an assumption. For practical reasons we often identify K=(Ks,Kd) with Ks∪Kd, e.g., expressions such as “r∈K” are to be read as “r∈Ks or r∈Kd”.

Arguments can now be constructed by chaining rules. Following [30], for each argument A we denote by Prem(A) the set of axioms and assumptions used to construct A, Conc(A) is the conclusion of A, Sub(A) is the set of sub-arguments of A, DefRules(A) the set of defeasible rules in A, and TopRule(A) is the last rule used in A.

Definition 12.

The set of arguments AK generated by a knowledge base K=(Ks,Kd) is inductively defined as follows:

If ⇒ϕ∈K then (⇒ϕ) is an argument with Prem(⇒ϕ)={⇒ϕ}, Conc(⇒ϕ)=ϕ, Sub(⇒ϕ)={⇒ϕ}, DefRules(⇒ϕ)={⇒ϕ}, TopRule(⇒ϕ)=(⇒ϕ).
If →ϕ∈K then (→ϕ) is an argument with Prem(→ϕ)={→ϕ}, Conc(→ϕ)=ϕ, Sub(→ϕ)={→ϕ}, DefRules(→ϕ)=∅, TopRule(→ϕ)=(→ϕ).
If ϕ1,…,ϕn⇒ψ∈K and A1,…,An are arguments such that ϕ1=Conc(A1),…,ϕn=Conc(An), then A=(A1,…,An⇒ψ) is an argument such that: Prem(A)=Prem(A1)∪⋯∪Prem(An), Conc(A)=ψ, Sub(A)=Sub(A1)∪⋯∪Sub(An)∪{A}, DefRules(A)=DefRules(A1)∪⋯∪DefRules(An)∪{ϕ1,…,ϕ1⇒ψ}, TopRule(A)=ϕ1,…,ϕn⇒ψ.
If ϕ1,…,ϕn→ψ∈K and A1,…,An are arguments such that ϕ1=Conc(A1),…,ϕn=Conc(An), then A=(A1,…,An→ψ) is an argument such that: Prem(A)=Prem(A1)∪⋯∪Prem(An), Conc(A)=ψ, Sub(A)=Sub(A1)∪⋯∪Sub(An)∪{A}, DefRules(A)=DefRules(A1)∪⋯∪DefRules(An), TopRule(A)=ϕ1,…,ϕn→ψ.

An argument A is called strict if DefRules(A)=∅, otherwise it is called defeasible. In our simplified framework, we only consider rebuts [30] as the attack relation between arguments.

Definition 13.

Let A and B be two arguments. We say that A attacks B, denoted as A⇝B, if Conc(A)=a‾ for some B′∈Sub(A) of the form B1″,…,Bn″⇒a.

Using the previous two definitions an abstract argumentation framework can be derived from a knowledge base K as follows.

Definition 14.

The abstract argumentation framework AFK corresponding to a knowledge base K is an argumentation framework AFK=(AK,⇝) where AK is the set of arguments generated by K as defined by Definition 12 and ⇝ is the attack relation on AK as defined by Definition 13.

Our algorithm for generating random ASPIC-like theories22 22 takes as input the number of propositions (n), the number of formulas (m), the maximum number of literals in bodies of rules (l) and the percentage of strict rules (s), and generates m rules, each with at most l body literals (uniformly distributed, zero body literals are also possible, giving rise to axioms and assumptions) and uniformly distributed head literal. Using this generator, we created the set sASPIC of 300 random instances with n=18, m=80, l=2, and s∈{0,0.33,0.66}.

An example of a ASPIC-like theory that is used to derive a Dung AF is as follows:

(1)a1⇒¬a1a4⇒a1a5⇒a2a1→¬a0a1,¬a1⇒a1¬a3,¬a2⇒a1¬a4,¬a3⇒¬a2¬a1,a0⇒a3→a2⇒a3a5,¬a5→¬a5⇒a0⇒¬a1⇒a1→a1⇒¬a2→¬a0¬a2⇒a4¬a0⇒¬a4

Appendix B.

Appendix B.Detailed experimental results

Table 2

Jaccard’s distance – for the aMillion dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

aMillion. Table 2 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 15 provides a boxplot representation of the distributions.

sABA. Table 3 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 16 provides a boxplot representation of the distributions.

Table 3

Jaccard’s distance – for the sABA dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

sASPIC. Table 4 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 17 provides a boxplot representation of the distributions.

Table 4

Jaccard’s distance – for the sASPIC dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

rSCC. Table 5 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 18 provides a boxplot representation of the distributions.

Fig. 15.

Distributions of Jaccard’s distance – for the aMillion dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

Fig. 16.

Distributions of Jaccard’s distance – for the sABA dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

Fig. 17.

Distributions of Jaccard’s distance – for the sASPIC dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

Table 5

Jaccard’s distance – for the rSCC dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

rStable. Table 6 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 19 provides a boxplot representation of the distributions.

Table 6

Jaccard’s distance – for the rStable dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

rER. Table 7 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 20 provides a boxplot representation of the distributions.

Table 7

Jaccard’s distance – for the rER dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

Fig. 18.

Distributions of Jaccard’s distance – for the rSCC dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

Fig. 19.

Distributions of Jaccard’s distance – for the rStable dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

rBA. Table 8 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 21 provides a boxplot representation of the distributions.

Table 8

Jaccard’s distance – for the rBA dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

rWS. Table 9 summarises the average Jaccard’s distance among the various sets of credulously or skeptically accepted arguments for the semantics identified in Definition 3, and Fig. 22 provides a boxplot representation of the distributions.

Table 9

Jaccard’s distance – for the rWS dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3

Appendix C.

Appendix C.Detailed results of feature selection for relative measure of skepticism

C.1.aMillion

ST-C: only matrix-related and number of argcs in undirected graph have some relevance.

Fig. 20.
Distributions of Jaccard’s distance – for the rER dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

Fig. 21.
Distributions of Jaccard’s distance – for the rBA dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).

Fig. 22.
Distributions of Jaccard’s distance – for the rWS dataset – between the various sets of credulously or skeptically accepted arguments for the various semantics identified in Definition 3 and: GR (a) (equivalent to μS); ID (b); ST-C (c); ST-S (d); PR-C (e); PR-S (f); SST-C (g); SST-S (h).
ST-S: set of very informative features. In particular, aperiodicity and flow hierarchy.
PR-C: set of informative features, most of them matrix-related, flow hierarchy, number of scc, aperiodicity. In particular, flow hierarchy seems to be the most important for predictions.
PR-S: quite hard to deal with, but looks like flow hierarchy, number of SCCs, and aperiodicity are somehow useful.
SST-C: set of very informative features. In particular, flow hierarchy and a few from matrix representation.
SST-S: only 3 useful features: flow hierarchy, number of SCCs, and aperiodicity.
ID: as in SST-S.

C.2.sABA

ST-C: not very informative features, but mostly related with matrix representation, ratio edges/arcs, and flow hierarchy.
ST-S: not very informative features, but mostly related with matrix representation, ratio edges/arcs, and flow hierarchy.
PR-C: small set of moderately informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, and matrix-related For predictions, flow hierarchy and aperiodicty are the most relevant.
PR-S: only 2 informative features: flow hierarchy and aperiodicity
SST-C: small set of moderately informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, and matrix-related For predictions, flow hierarchy and aperiodicty are the most important.
SST-S: only aperiodicity and flow hierarchy look to be quite informative.
ID: extremely hard to predict, we are not able to extract any meaningful piece of information.

C.3.sASPIC

ST-C: small set of very informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, and number of SCCs. For predictions, flow hierarchy, and aperiodicty are the most important.
ST-S: extremely hard to predict, we are not able to extract any meaningful piece of information.
PR-C: small set of very informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, and number of SCCs. For predictions, flow hierarchy and aperiodicty are the most important.
PR-S: extremely hard to predict, we are not able to extract any meaningful piece of information.
SST-C: small set of very informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, and number of SCCs. For predictions, flow hierarchy and aperiodicty are the most important.
SST-S: extremely hard to predict, we are not able to extract any meaningful piece of information.
ID: extremely hard to predict, we are not able to extract any meaningful piece of information.

C.4.rSCC

ST-C: huge set of informative features. Aperiodicity is the most informative, while aperiodicity and matrix variance are the most important for predictions.
ST-S: hard to predict, with just a small set of not-very-informative features. Aperiodicity, ratio edges/arcs, ratio edges/arcs on undirected graph and variance of the matrix diagonal are somewhat used in predictions.
PR-C: set of quite informative features, mostly related to degree and density, together with some aspects of matrix representation. In predictions, those related to matrix representation seem to be very relevant.
PR-S: set of quite informative features, mostly related to degree, density, and transitivity, together with some aspects of matrix representation, in particular: transitivity and matrix variance.
SST-C: large set of quite informative features, mostly related to degree and density, together with some aspects of matrix representation. For predictions, matrix-related average seems to be the most informative feature.
SST-S: set of quite informative features, mostly related to degree, density, and transitivity, together with some aspects of matrix representation, in particular: transitivity and matrix variance.
ID: set of quite informative features, mostly related to degree, density, and transitivity, together to some aspects of matrix representation, in particular, transitivity and matrix variance.

C.5.rStable

ST-C: small set of very informative features: flow hierarchy is the most important (also in prediction), and then ratio edges/arcs, and ratio edges/arcs on undirected graph.
ST-S: large set of moderately informative features. For predictions, flow hiearchy is the most important.
PR-C: small set of very informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, matrix-related, and number of SCCs. For predictions, flow hierarchy and aperiodicty are the most important.
PR-S: very hard to deal with. It looks that only flow hiearchy does provide some sort of information, albeit very limited.
SST-C: small set of very informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, matrix-related, and number of SCCs. For predictions, flow hierarchy and aperiodicty are the most important.
SST-S: small set of not very informative features. Again, it looks like flow hierarchy is the most important.
ID: extremely hard to predict, we are not able to extract any meaningful piece of information.

C.6.rER

ST-C: very large set of moderately informative features (some 20). Aperioditicy and those from matrix seem to be most informative. For predictions, aperiodicity and matdifdiag (difference of diagonal in matrix representation) are quite informative.
ST-S: large set of very informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), aperiodicity, matrix-related, and number of SCCs. For predictions, flow hierarchy, aperiodicty and matdifdiag are the most important.
PR-C: set of moderately informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), number of SCCs, and aperiodicity. In Particular, flow hierarchy seems to be the most important for predictions.
PR-S: as per before, together with some features from matrix representation. In particular, flow hierarchy seems to be the most important for predictions.
SST-C: very similar to PR-C.
SST-S: set of informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), number of SCCs, and aperiodicity. In particular, flow hierarchy seems to be the most important for predictions.
ID: set of informative features: flow hierarchy, ratio edges/arcs, ratio edges/arcs on undirected graph (multiple arcs collapsed), number of SCCs, and aperiodicity. In particular, flow hierarchy seems to be the most important for predictions.

C.7.rBA

ST-C: basically, as in PR-C.
PR-C: very informative small set of features: flow hierarchy, ratio edges/arcs, and number of SCCs. In Particular, flow hierarchy seems to be the most important for predictions.
SST-C: as in PR-C.

C.8.rWS

In this class we have the same behaviour for all the considered perspectives. There is a huge set of seemingly informative features, too many to list. Combined with the quite poor predictive performance, this may indicate that we do not capture the right aspect for relating the grounded extension with the set of credulously and skeptically accepted arguments w.r.t. other semantics in this set of AFs.

References

[1]	A. Barabasi and R. Albert, Emergence of scaling in random networks, Science 286: (5439) ((1999) ), 11.
[2]	P. Baroni, M. Caminada and M. Giacomin, An introduction to argumentation semantics, Knowledge Engineering Review 26: (4) ((2011) ), 365–410. doi:10.1017/S0269888911000166.
[3]	P. Baroni, F. Cerutti, P.E. Dunne and M. Giacomin, Automata for infinite argumentation structures, Artificial Intelligence 203: (0) ((2013) ), 104–150. doi:10.1016/j.artint.2013.05.002.
[4]	P. Baroni and M. Giacomin, Skepticism relations for comparing argumentation semantics, Int. J. of Approximate Reasoning 50: (6) ((2009) ), 854–866. doi:10.1016/j.ijar.2009.02.006.
[5]	M. Caminada, Semi-stable semantics, in: Proceedings of COMMA 2006, (2006) , pp. 121–130.
[6]	M. Caminada and D.M. Gabbay, A logical account of formal argumentation, Studia Logica (Special issue: new ideas in argumentation theory) 93: (2–3) ((2009) ), 109–145. doi:10.1007/s11225-009-9218-x.
[7]	F. Cerutti, M. Giacomin and M. Vallati, Algorithm selection for preferred extensions enumeration, in: Computational Models of Argument – Proceedings of COMMA 2014, Atholl Palace Hotel, Scottish Highlands, UK, September 9–12, 2014, S. Parsons, N. Oren, C. Reed and F. Cerutti, eds, Frontiers in Artificial Intelligence and Applications, Vol. 266: , IOS Press, (2014) , pp. 221–232. doi:10.3233/978-1-61499-436-7-221.
[8]	F. Cerutti, M. Giacomin and M. Vallati, Generating structured argumentation frameworks: AFBenchGen2., in: COMMA, P. Baroni, T.F. Gordon, T. Scheffler and M. Stede, eds, Frontiers in Artificial Intelligence and Applications, Vol. 287: , IOS Press, (2016) , pp. 467–468, http://dblp.uni-trier.de/db/conf/comma/comma2016.html#CeruttiGV16. ISBN 978-1-61499-686-6.
[9]	F. Cerutti, N. Oren, H. Strass, M. Thimm and M. Vallati, A benchmark framework for a computational argumentation competition, in: Proc. of COMMA, (2014) , pp. 459–460.
[10]	F. Cerutti, M. Vallati and M. Giacomin, Where are we now? State of the art and future trends of solvers for hard argumentation problems, in: Computational Models of Argument – Proceedings of COMMA 2016, Potsdam, Germany, 12–16 September, 2016, P. Baroni, T.F. Gordon, T. Scheffler and M. Stede, eds, Frontiers in Artificial Intelligence and Applications, Vol. 287: , IOS Press, (2016) , pp. 207–218. doi:10.3233/978-1-61499-686-6-207.
[11]	F. Cerutti, M. Vallati and M. Giacomin, On the impact of configuration on abstract argumentation automated reasoning, Int. J. Approx. Reason. 92: ((2018) ), 120–138. doi:10.1016/j.ijar.2017.10.002.
[12]	S.-S. Choi, S.-H. Cha and C.C. Tappert, A survey of binary similarity and distance measures, Journal of Systemics, Cybernetics and Informatics 8: (1) ((2010) ), 43–48.
[13]	D. Craandijk and F. Bex, Deep learning for abstract argumentation semantics, in: Proceedings of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI20), (2020) .
[14]	R. Craven and F. Toni, Argument graphs and assumption-based argumentation, Artificial Intelligence 233: ((2016) ), 1–59, http://www.sciencedirect.com/science/article/pii/S0004370215001800. doi:10.1016/j.artint.2015.12.004.
[15]	M.M. Deza and E. Deza, Encyclopedia of Distances, Springer, Berlin, Heidelberg, (2014) , https://books.google.it/books?id=q_7FBAAAQBAJ. ISBN 9783662443422.
[16]	S. Doutre and J.-G. Mailly, Comparison criteria for argumentation semantics, in: Multi-Agent Systems and Agreement Technologies, F. Belardinelli and E. Argente, eds, Springer International Publishing, (2018) , pp. 219–234. doi:10.1007/978-3-030-01713-2_16.
[17]	P.M. Dung, On the acceptability of arguments and its fundamental role in nonmonotonic reasoning, logic programming, and n-person games, Artificial Intelligence 77: (2) ((1995) ), 321–357. doi:10.1016/0004-3702(94)00041-X.
[18]	P.M. Dung, P. Mancarella and F. Toni, Computing ideal sceptical argumentation, Artificial Intelligence 171: (10) ((2007) ), 642–674, Argumentation in Artificial Intelligence. doi:10.1016/j.artint.2007.05.003.
[19]	W. Dvořák and P.E. Dunne, Computational problems in formal argumentation and their complexity, in: Handbook of Formal Argumentation, P. Baroni, D. Gabbay, M. Giacomin and L. van der Torre, eds, College Publications, (2018) , Chapter 14.
[20]	P. Erdös and A. Rényi, On random graphs. I, Publ. Math-Debrecen 6: ((1959) ), 290–297.
[21]	D.A. Freedman, Statistical Models: Theory and Practice, Cambridge University Press, (2009) . doi:10.1017/CBO9780511815867.
[22]	M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann and I.H. Witten, The WEKA data mining software: An update, SIGKDD Explorations 11: (1) ((2009) ), 10–18. doi:10.1145/1656274.1656278.
[23]	K.J. Horadam and M.A. Nyblom, Distances between sets based on set commonality, Discrete Applied Mathematics 167: ((2014) ), 310–314, http://www.sciencedirect.com/science/article/pii/S0166218X13004800. doi:10.1016/j.dam.2013.10.037.
[24]	P. Jaccard, Etude de la distribution florale dans une portion des Alpes et du Jura, Bulletin de la Societe Vaudoise des Sciences Naturelles 37: ((1901) ), 547–579.
[25]	T.N. Kipf and M. Welling, Semi-supervised classification with graph convolutional networks, arXiv:1609.02907 [cs, stat], 2017.
[26]	I. Kuhlmann and M. Thimm, Using graph convolutional networks for approximate reasoning with abstract argumentation frameworks: A feasibility study, in: Proceedings of the 13th International Conference on Scalable Uncertainty Management (SUM’19), N.B. Amor, B. Quost and M. Theobald, eds, Vol. 11940: , Springer International Publishing, (2019) , pp. 24–37. doi:10.1007/978-3-030-35514-2_3.
[27]	M. Levandowsky and D. Winter, Distance between sets, Nature 234: (5323) ((1971) ), 34–35. doi:10.1038/234034a0.
[28]	L. Malmqvist, T. Yuan, P. Nigthingale and S. Manandhar, Determining the acceptability of abstract arguments with graph convolutional networks, in: Proceedings of the 3rd International Workshop on Systems and Algorithms for Formal Argumentation (SAFA’20), (2020) .
[29]	E. Marczewski and H. Steinhaus, On a certain distance of sets and the corresponding distance of functions, Colloquium Mathematicum 6: (1) ((1958) ), 319–327, http://eudml.org/doc/210378. doi:10.4064/cm-6-1-319-327.
[30]	S. Modgil and H. Prakken, The ASPIC+ framework for structured argumentation: A tutorial, Argument & Computation 5: ((2014) ), 31–62. doi:10.1080/19462166.2013.869766.
[31]	D.J. Rogers and T.T. Tanimoto, A computer program for classifying plants, Science 132: (3434) ((1960) ), 1115–1118, https://science.sciencemag.org/content/132/3434/1115. doi:10.1126/science.132.3434.1115.
[32]	A. Toniolo, T.J. Norman, A. Etuk, F. Cerutti, R.W. Ouyang, M. Srivastava, N. Oren, T. Dropps, J.A. Allen and P. Sullivan, Agent support to reasoning with different types of evidence in intelligence analysis, in: Proc. of AAMAS 2015, (2015) , pp. 781–789.
[33]	M. Vallati, F. Cerutti and M. Giacomin, Argumentation frameworks features: An initial study, in: ECAI 2014 – 21st European Conference on Artificial Intelligence, 18–22 August 2014, Prague, Czech Republic – Including Prestigious Applications of Intelligent Systems (PAIS 2014), T. Schaub, G. Friedrich and B. O’Sullivan, eds, Frontiers in Artificial Intelligence and Applications, Vol. 263: , IOS Press, (2014) , pp. 1117–1118. doi:10.3233/978-1-61499-419-0-1117.
[34]	M. Vallati, F. Cerutti and M. Giacomin, On the combination of argumentation solvers into parallel portfolios, in: AI 2017: Advances in Artificial Intelligence – 30th Australasian Joint Conference, Melbourne, VIC, Australia, August 19–20, 2017, Proceedings, W. Peng, D. Alahakoon and X. Li, eds, Lecture Notes in Computer Science, Vol. 10400: , Springer, (2017) , pp. 315–327. doi:10.1007/978-3-319-63004-5_25.
[35]	M. Vallati, F. Cerutti and M. Giacomin, Predictive models and abstract argumentation: The case of high-complexity semantics, The Knowledge Engineering Review 34: ((2019) ), e6. doi:10.1017/S0269888918000036.
[36]	B. Verheij, Two approaches to dialectical argumentation:admissible sets and argumentation stages, in: Proceedings of the Eighth Dutch Conference on Artificial Intelligence (NAIC’96), Utrecht, NL, J.-J.Ch. Meyer and L.C. van der Gaag, eds, (1996) , pp. 357–368.
[37]	D.J. Watts and S.H. Strogatz, Collective dynamics of ‘small-world’ networks., Nature 393: (6684) ((1998) ), 440–442. doi:10.1038/30918.