2021-10-08 - Homepage Rewrite
My website is now generated with Sphinx.
2021-10-06 - Are Cluster Validity Measures (In)valid?
To appear in Information Sciences — a paper coauthored by Maciek Bartoszuk and Ania Cena (doi:10.1016/j.ins.2021.10.004).
2021-09-27 - Paper on stringi
A paper on my stringi package has been accepted for publication in Journal of Statistical Software.
A new paper by Maciek Bartoszuk and me is to appear in Knowledge-Based Systems (doi:10.1016/j.knosys.2021.107427).
English is the native language for only 5% of the World population. Also, only 17% of us can understand this text. Moreover, the Latin alphabet is the main one for merely 36% of the total. The early computer era, now a very long time ago, was dominated by the US. Due to the proliferation of the internet, smartphones, social media, and other technologies and communication platforms, this is no longer the case. The stringx package replaces base R string functions (such as
sprintf()) with ones that fully support the Unicode standards related to natural language processing, fixes some long-standing inconsistencies, and introduces some new, useful features. Thanks to ICU (International Components for Unicode) and stringi, they are fast, reliable, and portable across different platforms. Now available from CRAN.
2021-07-14 - stringi 1.7.2
Another major update of stringi brings a rewritten version of
stri_sprintf, support for custom rule-based transliteration, extraction of named regex capture groups, and many other enhancements.
2021-06-17 - realtest 0.2.1 on CRAN
An update to realtest is now available.
realtest is a framework for unit testing for realistic minimalists, where we distinguish between expected, acceptable, current, fallback, ideal, or regressive behaviour. It can also be used for monitoring other software projects for changes. Now available on CRAN.
2021-05-27 - Paper on the genieclust Python+R package