(Pronounced like Mark Gaggle-Eve-Ski) 🙃🏷 Senior Lecturer in Applied Artificial Intelligence
Researcher in the Science of Data (with particular emphasis on modelling of complex phenomena and developing of usable, general purpose algorithms)
Free (Libre) and Open Source Data Analysis Software Developer
Data Science, Machine Learning, and Statistical Computing Tutor & Trainer
stringx: Drop-in replacements for base R string functions powered by stringi
English is the native language for only 5% of the World population.
Also, only 17% of us can understand this text. Moreover, the Latin alphabet
is the main one for merely 36% of the total. The early computer era, now a
very long time ago, was dominated by the US. Due to the proliferation of
the internet, smartphones, social media, and other technologies and
communication platforms, this is no longer the case.
package replaces base R string functions (such as grep(), tolower(),
and sprintf()) with ones that fully support the Unicode standards related
to natural language processing, fixes some long-standing inconsistencies,
and introduces some new, useful features. Thanks to
ICU (International Components for
they are fast, reliable, and portable across different platforms.
Now available from CRAN.
realtest: When Expectations Meet Reality: Realistic Unit Testing in R
is a framework for unit testing for realistic minimalists, where we distinguish
between expected, acceptable, current, fallback, ideal, or regressive behaviour.
It can also be used for monitoring other software projects for changes.
Now available on CRAN.
A maintenance release of the Python and R
for fast and robust hierarchical clustering with noise point detection
is now available on PyPI and CRAN.
On the aggregation of compositional data
Raul Pérez-Fernández, Bernard De Baets and I have a new paper accepted for publication in Information Fusion; abstract in the sequel. Read more…