2019-11-14 new paper

Robust fitting for the Sugeno integral with respect to general fuzzy measures

The editor of Information Sciences have just let us know that a paper by Gleb Beliakov, Simon James and me will be published in this outlet.

Abstract. The Sugeno integral is an expressive aggregation function with potential applications across a range of decision contexts. Its calculation requires only the lattice minimum and maximum operations, making it particularly suited to ordinal data and robust to scale transformations. However, for practical use in data analysis and prediction, we require efficient methods for learning the associated fuzzy measure. While such methods are well developed for the Choquet integral, the fitting problem is more difficult for the Sugeno integral because it is not amenable to being expressed as a linear combination of weights, and more generally due to plateaus and non-differentiability in the objective function. Previous research has hence focused on heuristic approaches or simplified fuzzy measures. Here we show that the problem of fitting the Sugeno integral to data such that the maximum absolute error is minimized can be solved using an efficient bilevel program. This method can be incorporated into algorithms that learn fuzzy measures with the aim of minimizing the median residual. This equips us with tools that make the Sugeno integral a feasible option in robust data regression and analysis. We provide experimental comparison with a genetic algorithms approach and an example in data analysis.


Deakin University

On 23rd of September 2019 I commence as a Senior Lecturer in Applied Artificial Intelligence at Deakin University in Melbourne-Burwood, Australia (Australian senior lecturer is supposed to be equivalent to an associate professor in the US).
2019-09-10 new paper

Constrained Ordered Weighted averaging aggregation with multiple comonotone constraints

Lucian Coroianu, Robert Fullér, Simon James and I got a paper accepted in the Fuzzy Sets and Systems outlet. Abstract below.

Abstract. The constrained ordered weighted averaging (OWA) aggregation problem arises when we aim to maximize or minimize a convex combination of order statistics under linear inequality constraints that act on the variables with respect to their original sources. The standalone approach to optimizing the OWA under constraints is to consider all permutations of the inputs, which becomes quickly infeasible when there are more than a few variables, however in certain cases we can take advantage of the relationships amongst the constraints and the corresponding solution structures. For example, we can consider a land-use allocation satisfaction problem with an auxiliary aim of balancing land-types, whereby the response curves for each species are non-decreasing with respect to the land-types. This results in comonotone constraints, which allow us to drastically reduce the complexity of the problem.
In this paper, we show that if we have an arbitrary number of constraints that are comonotone (i.e., they share the same ordering permutation of the coefficients), then the optimal solution occurs for decreasing components of the solution. After investigating the form of the solution in some special cases and providing theoretical results that shed light on the form of the solution, we detail practical approaches to solving and give real-world examples.

2019-06-17 new PhD

Jan Lasek's PhD defence

My PhD student, Jan Lasek, has successfully defended his doctoral thesis, New data-driven rating systems for association football. :)
2019-06-08 new paper

Aggregation on ordinal scales with the Sugeno integral for biomedical applications

Gleb Beliakov, Simon James and I got another paper accepted for publication in Information Sciences. This time we re-write a learning-to-aggregate problem based on the Sugeno integral in a difference-of-convex objective setting. The derived tool is particularly useful when working with ordinal data.

Abstract. The Sugeno integral is a function particularly suited to the aggregation of ordinal inputs. Defined with respect to a fuzzy measure, its ability to account for complementary and redundant relationships between variables brings much potential to the field of biomedicine, where it is common for measurements and patient information to be expressed qualitatively. However, practical applications require well-developed methods for identifying the Sugeno integral's parameters, and this task is not easily expressed using the standard optimisation approaches. Here we formulate the objective function as the difference of two convex functions, which enables the use of specialised numerical methods. Such techniques are compared with other global optimisation frameworks through a number of numerical experiments.

2019-05-14 new paper

New paper on information fusion

A taxonomy of monotonicity properties for the aggregation of multidimensional data – joint work with Raúl Pérez-Fernández and Bernard De Baets has been accepted for publication in Information Fusion.

Abstract. The property of monotonicity, which requires a function to preserve a given order, has been considered the standard in the aggregation of real numbers for decades. In this paper, we argue that, for the case of multidimensional data, an order-based definition of monotonicity is far too restrictive. We propose several meaningful alternatives to this property not involving the preservation of a given order by returning to its early origins stemming from the field of calculus. Numerous aggregation methods for multidimensional data commonly used by practitioners are studied within our new framework.

2019-03-25 new paper

An inherent difficulty in the aggregation of multidimensional data

Accepted for publication in IEEE Transactions on Fuzzy SystemsAn inherent difficulty in the aggregation of multidimensional data by Raúl Pérez-Fernández, Bernard De Baets, and me. Abstract below.

Abstract. In the field of information fusion, the problem of data aggregation has been formalized as an order-preserving process that builds upon the property of monotonicity. However, fields such as computational statistics, data analysis and geometry, usually emphasize the role of equivariances to various geometrical transformations in aggregation processes. Admittedly, if we consider a unidimensional data fusion task, both requirements are often compatible with each other. Nevertheless, in this paper we show that, in the multidimensional setting, the only idempotent functions that are monotone and orthogonal equivariant are the over-simplistic weighted centroids. Even more, this result still holds after replacing monotonicity and orthogonal equivariance by the weaker property of orthomonotonicity. This implies that the aforementioned approaches to the aggregation of multidimensional data are irreconcilable, and that, if a weighted centroid is to be avoided, we must choose between monotonicity and a desirable behaviour with regard to orthogonal transformations.

2019-03-13 new paper

Should we introduce a dislike button for academic papers?

The abstract of our (by Agnieszka Geras, Grzegorz Siudem, and me) recent paper to be published in Journal of the Association for Information Science and Technology can be found below.

Abstract. On the grounds of the revealed, mutual resemblance between the behaviour of users of Stack Exchange and the dynamics of the citations accumulation process in the scientific community, we tackled an outwardly intractable problem of assessing the impact of introducing "negative" citations.
Although the most frequent reason to cite a paper is to highlight the connection between the two publications, researchers sometimes mention an earlier work to cast a negative light. While computing citation-based scores, for instance the h-index, information about the reason why a paper was mentioned is neglected. Therefore it can be questioned whether these indices describe scientific achievements accurately.
In this contribution we shed insight into the problem of "negative" citations, analysing data from Stack Exchange and, to draw more universal conclusions, we derive an approximation of citations scores. Here we show that the quantified influence of introducing negative citations is of lesser importance and that they could be used as an indicator of where attention of scientific community is allocated.