2017-04-07 software

stringi 1.1.5 released

Another bugfix release of the stringi package for R is on its way to CRAN. The package provides powerful string processing facilities to R users and developers and is ranked as one of the most often downloaded R extensions.


* [GENERAL] `stringi` now requires ICU4C >= 52.

* [GENERAL] `stringi` now requires R >= 2.14.

* [BUGFIX] Fixed errors pointed out by `clang-UBSAN` in `stri_brkiter.h`.

* [BUILD TIME] #238, #220: Try "standard" ICU4C build flags if a call
to `pkg-config` fails.

* [BUILD TIME] #258: Use `CXX11` instead of `CXX1X` on R >= 3.4.

* [BUILD TIME, BUGFIX] #254: `dir.exists()` is R >= 3.2.
2017-03-21 software

stringi 1.1.3 released

I have submitted a new (bugfix) release of the stringi package to CRAN.


* [REMOVE DEPRECATED] `stri_install_check()` and `stri_install_icudt()`
marked as deprecated in `stringi` 0.5-5 are no longer being exported.

* [BUGFIX] #227: Incorrect behavior of `stri_sub()` and `stri_sub<-()`
if the empty string was the result.

* [BUILD TIME] #231: The `./configure` (*NIX only) script now reads the
following environment varialbes: `STRINGI_CFLAGS`, `STRINGI_CPPFLAGS`,
see `INSTALL` for more information.

* [BUILD TIME] #253: call to `R_useDynamicSymbols` added.

* [BUILD TIME] #230: icudt is now being downloaded by
`./configure` (*NIX only) *before* building.

* [BUILD TIME] #242: `_COUNT/_LIMIT` enum constants have been deprecated
as of ICU 58.2, stringi code has been upgraded accordingly.
2017-03-15 new paper

FUZZ-IEEE'17: Two Papers Accepted

Two papers I co-author have been accepted for publication in Proceedings of the FUZZ-IEEE'17 conference that will be held in Naples, Italy.
  • Bartoszuk M., Gagolewski M., Binary aggregation functions in software plagiarism detection, In: Proc. FUZZ-IEEE'17, IEEE, 2017. (accepted for publication)
  • Cena A., Gagolewski M., OWA-based linkage and the Genie correction for hierarchical clustering, In: Proc. FUZZ-IEEE'17, IEEE, 2017. (accepted for publication)
2016-12-12 new paper

Penalty-Based Aggregation of Multidimensional Data

My paper Penalty-Based Aggregation of Multidimensional Data has been accepted for publication in Fuzzy Sets and Systems (Special Issue on Aggregation Functions).
Abstract. Research in aggregation theory is nowadays still mostly focused on algorithms to summarize tuples consisting of observations in some real interval or of diverse general ordered structures. Of course, in practice of information processing many other data types between these two extreme cases are worth inspecting. This contribution deals with the aggregation of lists of data points in Rd for arbitrary d≥1. Even though particular functions aiming to summarize multidimensional data have been discussed by researchers in data analysis, computational statistics and geometry, there is clearly a need to provide a comprehensive and unified model in which their properties like equivariances to geometric transformations, internality, and monotonicity may be studied at an appropriate level of generality. The proposed penalty-based approach serves as a common framework for all idempotent information aggregation methods, including componentwise functions, pairwise distance minimizers, and data depth-based medians. It also allows for deriving many new practically useful tools.
2016-11-21 new book

Przetwarzanie i analiza danych w języku Python

My book on Python for Data Processing and Analysis is now available in Polish book stores.
Przetwarzanie i analiza danych w języku Python - okładka
2016-11-21 new book

Programowanie w języku R (2nd Ed., revised and extended)

The 2nd edition of my R Programming Book is now available in Polish book stores.
Programowanie w języku R - okładka

Eusflat'17 Special Session:
Algorithms for Data Aggregation and Fusion

Call for contributions – EUSFLAT 2017 (10th Conference of the European Society for Fuzzy Logic and Technology, Warsaw, Poland) Special Session Algorithms for Data Aggregation and Fusion; for more details, click here.
2016-10-27 new paper

Penalty-Based and Other Representations of Economic Inequality

My paper with Gleb Beliakov and Simon James, entitled Penalty-based and other representations of economic inequality, has been accepted for publication in International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems today.
Abstract. Economic inequality measures are employed as a key component in various socio-demographic indices to capture the disparity between the wealthy and poor. Since their inception, they have also been used as a basis for modelling spread and disparity in other contexts. While recent research has identified that a number of classical inequality and welfare functions can be considered in the framework of OWA operators, here we propose a framework of penalty-based aggregation functions and their associated penalties as measures of inequality.