https://www.gagolewski.comBlog2024-03-07T02:43:07.533125+00:00ABloghttps://www.gagolewski.com/_news/20240108-paper-lorenz-gini.htmlGini-stable Lorenz curves and their relation to the generalised Pareto distribution2024-01-08T00:00:00+11:00<p class="ablog-post-excerpt"><p><em>Journal of Informetrics</em> will publish a new paper of ours
(by Lucio Bertoli-Barsotti, Grzegorz Siudem, Barbara Żogała-Siudem, and yours truly);
(DOI:<a class="reference external" href="https://dx.doi.org/10.1016/j.joi.2024.101499">10.1016/j.joi.2024.101499</a>).</p>
</p>
Journal of Informetrics will publish a new paper of ours
(by Lucio Bertoli-Barsotti, Grzegorz Siudem, Barbara Żogała-Siudem, and yours truly);
(DOI:10.1016/j.joi.2024.101499).2024-01-08T00:00:00+11:00https://www.gagolewski.com/_news/20240104-paper-randomfn.htmlRandom generation of linearly constrained fuzzy measures and domain coverage performance evaluation2024-01-04T00:00:00+11:00<p class="ablog-post-excerpt"><p>Jian-Zhang Wu, Gleb Beliakov, Simon James, and I published a new paper in <em>Information Sciences</em>
(DOI:<a class="reference external" href="https://dx.doi.org/10.1016/j.ins.2023.120080">10.1016/j.ins.2023.120080</a>).</p>
</p>
Jian-Zhang Wu, Gleb Beliakov, Simon James, and I published a new paper in Information Sciences
(DOI:10.1016/j.ins.2023.120080).2024-01-04T00:00:00+11:00https://www.gagolewski.com/_news/20231109-software-stringi-1-8-1.htmlstringi 1.8.1 on CRAN2023-11-09T00:00:00+11:00<p class="ablog-post-excerpt"><p>A new major release of <a class="reference external" href="https://stringi.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong></a>
was submitted to <a class="reference external" href="https://cran.r-project.org/package=stringi">CRAN</a>.</p>
</p>
A new major release of stringi
was submitted to CRAN.2023-11-09T00:00:00+11:00https://www.gagolewski.com/_news/20231003-paper-owalink.htmlHierarchical Clustering with OWA-based Linkages, the Lance-Williams Formula, and Dendrogram Inversions2023-10-03T00:00:00+11:00<p class="ablog-post-excerpt"><p>A paper by Anna Cena, Simon James, Gleb Beliakov, and I entitled
<em>Hierarchical Clustering with OWA-based Linkages, the Lance-Williams Formula, and Dendrogram Inversions</em>
has been accepted for publication in <em>Fuzzy Sets and Systems</em>
(DOI:<a class="reference external" href="https://dx.doi.org/10.1016/j.fss.2023.108740">10.1016/j.fss.2023.108740</a>).
A preprint is available on <a class="reference external" href="https://arxiv.org/abs/2303.05683">arXiv</a>.</p>
</p>
A paper by Anna Cena, Simon James, Gleb Beliakov, and I entitled
Hierarchical Clustering with OWA-based Linkages, the Lance-Williams Formula, and Dendrogram Inversions
has been accepted for publication in Fuzzy Sets and Systems
(DOI:10.1016/j.fss.2023.108740).
A preprint is available on arXiv.2023-10-03T00:00:00+11:00https://www.gagolewski.com/_news/20231001-submitted-nca.htmlSubmitted: Normalised Clustering Accuracy2023-10-01T00:00:00+10:00<p class="ablog-post-excerpt"><p>An revised version of a paper
<em>Normalised Clustering Accuracy: An Asymmetric External Cluster Validity Measure</em>
(in a previous draft called <em>Adjusted Asymmetric Accuracy</em>)
is now available on <a class="reference external" href="https://arxiv.org/abs/2209.02935">arXiv</a>.</p>
</p>
An revised version of a paper
Normalised Clustering Accuracy: An Asymmetric External Cluster Validity Measure
(in a previous draft called Adjusted Asymmetric Accuracy)
is now available on arXiv.2023-10-01T00:00:00+10:00https://www.gagolewski.com/_news/20230628-deepr.htmlDeep R Programming v1.0.02023-06-28T00:00:00+10:00<p class="ablog-post-excerpt"><p>Final version of <em>Deep R Programming</em> is now available.</p>
</p>
Final version of Deep R Programming is now available.2023-06-28T00:00:00+10:00https://www.gagolewski.com/_news/20230418-submitted-gini-lorenz.htmlSubmitted: Gini-stable Lorenz curves and their relation to the generalised Pareto distribution2023-04-18T00:00:00+10:00<p class="ablog-post-excerpt"><p>A preprint of our (Lucio Bertoli-Barsotti, Marek Gagolewski, Grzegorz Siudem, and Barbara Żogała-Siudem) new contribution
<em>Gini-stable Lorenz curves and their relation to the generalised Pareto distribution</em>
is now available on <a class="reference external" href="https://arxiv.org/abs/2304.07480">arXiv</a>.</p>
</p>
A preprint of our (Lucio Bertoli-Barsotti, Marek Gagolewski, Grzegorz Siudem, and Barbara Żogała-Siudem) new contribution
Gini-stable Lorenz curves and their relation to the generalised Pareto distribution
is now available on arXiv.2023-04-18T00:00:00+10:00https://www.gagolewski.com/_news/20230418-submitted-equivalence-inequality.htmlSubmitted: Equivalence of inequality indices: Three dimensions of impact revisited2023-04-18T00:00:00+10:00<p class="ablog-post-excerpt"><p>A preprint of our (Lucio Bertoli-Barsotti, Marek Gagolewski, Grzegorz Siudem, and Barbara Żogała-Siudem) new contribution
<em>Equivalence of inequality indices: Three dimensions of impact revisited</em>
is now available on <a class="reference external" href="https://arxiv.org/abs/2304.07479">arXiv</a>.</p>
</p>
A preprint of our (Lucio Bertoli-Barsotti, Marek Gagolewski, Grzegorz Siudem, and Barbara Żogała-Siudem) new contribution
Equivalence of inequality indices: Three dimensions of impact revisited
is now available on arXiv.2023-04-18T00:00:00+10:00https://www.gagolewski.com/_news/20230323-submitted-genie-graphs.htmlSubmitted: Community Detection in Complex Networks2023-03-23T00:00:00+11:00<p class="ablog-post-excerpt"><p>An early version of my most recent paper
<em>Community detection in complex networks via node similarity, graph representation learning, and hierarchical clustering</em>
is now available on <a class="reference external" href="https://arxiv.org/abs/2303.12212">arXiv</a>.</p>
</p>
An early version of my most recent paper
Community detection in complex networks via node similarity, graph representation learning, and hierarchical clustering
is now available on arXiv.2023-03-23T00:00:00+11:00https://www.gagolewski.com/_news/20230310-submitted-clumst.htmlSubmitted: Clustering with minimum spanning trees: How good can it be?2023-03-10T00:00:00+11:00<p class="ablog-post-excerpt"><p>An early version of my most recent paper
<em>Clustering with minimum spanning trees: How good can it be?</em>
is now available on <a class="reference external" href="https://arxiv.org/abs/2303.05679">arXiv</a>.</p>
</p>
An early version of my most recent paper
Clustering with minimum spanning trees: How good can it be?
is now available on arXiv.2023-03-10T00:00:00+11:00https://www.gagolewski.com/_news/20230127-paper-fss-benchmark-integral.htmlA Benchmark-type Generalisation of the Sugeno Integral with Applications in Bibliometrics2023-01-27T00:00:00+11:00<p class="ablog-post-excerpt"><p>New paper by Michał Boczek, Marek Kaluszka, Andrzej Okolewski, and yours truly
to appear in <em>Fuzzy Sets and Systems</em>
(DOI: <a class="reference external" href="https://doi.org/10.1016/j.fss.2023.01.014">10.1016/j.fss.2023.01.014</a>).</p>
</p>
New paper by Michał Boczek, Marek Kaluszka, Andrzej Okolewski, and yours truly
to appear in Fuzzy Sets and Systems
(DOI: 10.1016/j.fss.2023.01.014).2023-01-27T00:00:00+11:00https://www.gagolewski.com/_news/20221228-deepr-draft.htmlDeep R Programming (First Draft)2022-12-28T00:00:00+11:00<p class="ablog-post-excerpt"><p>I’ve released an early draft of my new textbook
<em>Deep R Programming</em> – the first 12 chapters.
It is a comprehensive course on one of the most popular
languages in data science (statistical computing, graphics,
machine learning, data wrangling and analytics). It introduces
the base language in-depth and is aimed at ambitious students,
practitioners, and researchers who would like to become independent
users of this powerful environment.</p>
</p>
I’ve released an early draft of my new textbook
Deep R Programming – the first 12 chapters.
It is a comprehensive course on one of the most popular
languages in data science (statistical computing, graphics,
machine learning, data wrangling and analytics). It introduces
the base language in-depth and is aimed at ambitious students,
practitioners, and researchers who would like to become independent
users of this powerful environment.2022-12-28T00:00:00+11:00https://www.gagolewski.com/_news/20220921-paper-clustering-benchmarks.htmlA Framework for Benchmarking Clustering Algorithms2022-11-16T00:00:00+11:00<p class="ablog-post-excerpt"><p>A paper related to my framework for benchmarking clustering algorithms
will appear in <em>SoftwareX</em>
(DOI: <a class="reference external" href="https://doi.org/10.1016/j.softx.2022.101270">10.1016/j.softx.2022.101270</a>).
Its preprint is available on <a class="reference external" href="https://arxiv.org/abs/2209.09493">arXiv</a>.
The project also has a dedicated website:
<a class="reference external" href="https://clustering-benchmarks.gagolewski.com">https://clustering-benchmarks.gagolewski.com</a>.</p>
</p>
A paper related to my framework for benchmarking clustering algorithms
will appear in SoftwareX
(DOI: 10.1016/j.softx.2022.101270).
Its preprint is available on arXiv.
The project also has a dedicated website:
https://clustering-benchmarks.gagolewski.com.2022-11-16T00:00:00+11:00https://www.gagolewski.com/_news/20221115-paper-joi-interpretable-citation-models.htmlInterpretable Reparameterisations of Citation Models2022-11-15T00:00:00+11:00<p class="ablog-post-excerpt"><p>To be published in <em>Journal of Informetrics</em>: a new paper
by Barbara Żogała-Siudem, Anna Cena, Greg Siudem, and I
(DOI: <a class="reference external" href="https://doi.org/10.1016/j.joi.2022.101355">10.1016/j.joi.2022.101355</a>).</p>
</p>
To be published in Journal of Informetrics: a new paper
by Barbara Żogała-Siudem, Anna Cena, Greg Siudem, and I
(DOI: 10.1016/j.joi.2022.101355).2022-11-15T00:00:00+11:00https://www.gagolewski.com/_news/20221014-paper-joi-journals.htmlAccidentality in Journal Citation Patterns2022-10-14T00:00:00+11:00<p class="ablog-post-excerpt"><p>Maciej J. Mrowiński, Grzesiek Siudem, and I will have another contribution in
the <em>Journal of Informetrics</em>
(DOI: <a class="reference external" href="https://doi.org/10.1016/j.joi.2022.101341">10.1016/j.joi.2022.101341</a>).</p>
</p>
Maciej J. Mrowiński, Grzesiek Siudem, and I will have another contribution in
the Journal of Informetrics
(DOI: 10.1016/j.joi.2022.101341).2022-10-14T00:00:00+11:00https://www.gagolewski.com/_news/20220905-software-genieclust-1-1-0.htmlgenieclust 1.1.0 on PyPI and CRAN2022-09-05T00:00:00+10:00<p class="ablog-post-excerpt"><p>A new release of the <a class="reference external" href="https://genieclust.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">genieclust</span></code></strong></a>
package is available on <a class="reference external" href="https://pypi.org/project/genieclust/">PyPI</a>
and <a class="reference external" href="https://cran.r-project.org/web/packages/genieclust/">CRAN</a>.</p>
</p>
A new release of the genieclust
package is available on PyPI
and CRAN.2022-09-05T00:00:00+10:00https://www.gagolewski.com/_news/20220825-datawranglingpy-amazon.htmlMinimalist Data Wrangling with Python – Paperback Available2022-08-24T00:00:00+10:00<p class="ablog-post-excerpt"><p>A printed version of my open-access textbook
<a class="reference external" href="https://datawranglingpy.gagolewski.com/">Minimalist Data Wrangling with Python</a>
can now be ordered from <a class="reference external" href="https://www.amazon.com/dp/0645571911">Amazon</a>.
It is exactly the same as the freely available
<a class="reference external" href="https://datawranglingpy.gagolewski.com/datawranglingpy.pdf">PDF version</a>.</p>
</p>
A printed version of my open-access textbook
Minimalist Data Wrangling with Python
can now be ordered from Amazon.
It is exactly the same as the freely available
PDF version.2022-08-24T00:00:00+10:00https://www.gagolewski.com/_news/20220810-paper-physicaa-pricepareto2.htmlPower Laws, the Price Model, and the Pareto type-2 Distribution2022-08-10T00:00:00+10:00<p class="ablog-post-excerpt"><p>A new contribution of ours (with Grzesiek Siudem and Przemysław Nowak)
will appear in <em>Physica A: Statistical Mechanics and its Applications</em>
(<a class="reference external" href="https://arxiv.org/abs/2201.11456">preprint</a>;
(DOI: <a class="reference external" href="https://doi.org/10.1016/j.physa.2022.128059">10.1016/j.physa.2022.128059</a>).</p>
</p>
A new contribution of ours (with Grzesiek Siudem and Przemysław Nowak)
will appear in Physica A: Statistical Mechanics and its Applications
(preprint;
(DOI: 10.1016/j.physa.2022.128059).2022-08-10T00:00:00+10:00https://www.gagolewski.com/_news/20220808-software-genieclust-1-0-1.htmlgenieclust 1.0.1 on PyPI and CRAN2022-08-08T00:00:00+10:00<p class="ablog-post-excerpt"><p>A new release of <a class="reference external" href="https://genieclust.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">genieclust</span></code></strong></a>
has been published on <a class="reference external" href="https://pypi.org/project/genieclust/">PyPI</a>
and <a class="reference external" href="https://cran.r-project.org/web/packages/genieclust/">CRAN</a>.</p>
</p>
A new release of genieclust
has been published on PyPI
and CRAN.2022-08-08T00:00:00+10:00https://www.gagolewski.com/_news/20220716-datawranglingpy.htmlMinimalist Data Wrangling with Python2022-07-16T00:00:00+10:00<p class="ablog-post-excerpt"><p>I’ve completed a
<a class="reference external" href="https://datawranglingpy.gagolewski.com/">textbook</a>
on data wrangling with Python.
This work is, and will remain, available for everyone’s enjoyment,
because I believe that education should be free for all.
Just like open-source software, more open-access textbooks
are urgently needed. Free == independent == higher quality.</p>
</p>
I’ve completed a
textbook
on data wrangling with Python.
This work is, and will remain, available for everyone’s enjoyment,
because I believe that education should be free for all.
Just like open-source software, more open-access textbooks
are urgently needed. Free == independent == higher quality.2022-07-16T00:00:00+10:00https://www.gagolewski.com/_news/20220711-software-stringi-1-7-8.htmlstringi 1.7.8 on CRAN2022-07-11T00:00:00+10:00<p class="ablog-post-excerpt"><p>A maintenance release of <a class="reference external" href="https://stringi.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong></a>
is now available on <a class="reference external" href="https://cran.r-project.org/web/packages/stringi/">CRAN</a>.</p>
</p>
A maintenance release of stringi
is now available on CRAN.2022-07-11T00:00:00+10:00https://www.gagolewski.com/_news/20220702-paper-fss-antibuoyant.htmlReduction of Variables and Constraints in Fitting Antibuoyant Fuzzy Measures to Data Using Linear Programming2022-07-02T00:00:00+10:00<p class="ablog-post-excerpt"><p>Gleb Beliakov, Simon James, and I will have another paper published in
<em>Fuzzy Sets and Systems</em>
(DOI: <a class="reference external" href="https://doi.org/10.1016/j.fss.2022.06.025">10.1016/j.fss.2022.06.025</a>).</p>
</p>
Gleb Beliakov, Simon James, and I will have another paper published in
Fuzzy Sets and Systems
(DOI: 10.1016/j.fss.2022.06.025).2022-07-02T00:00:00+10:00https://www.gagolewski.com/_news/20220504-paper-jasist-timetovote.htmlTime to Vote: Temporal Clustering of User Activity on Stack Overflow2022-05-04T00:00:00+10:00<p class="ablog-post-excerpt"><p>A new paper of mine (coauthors: Agnieszka Geras, Grzesiek Siudem)
will appear in the
<em>Journal of the Association for Information Science and Technology</em>
(DOI: <a class="reference external" href="https://doi.org/10.1002/asi.24658">10.1002/asi.24658</a>).</p>
</p>
A new paper of mine (coauthors: Agnieszka Geras, Grzesiek Siudem)
will appear in the
Journal of the Association for Information Science and Technology
(DOI: 10.1002/asi.24658).2022-05-04T00:00:00+10:00https://www.gagolewski.com/_news/20220315-paper-scientometrics-ockham.htmlOckham’s Index of Scientific Impact2022-03-15T00:00:00+11:00<p class="ablog-post-excerpt"><p>A new paper of mine (co-authored by Basia Żogała-Siudem,
Grzesiek Siudem, and Ania Cena) will appear in <em>Scientometrics</em>
(DOI: <a class="reference external" href="https://doi.org/10.1007/s11192-022-04345-2">10.1007/s11192-022-04345-2</a>)</p>
</p>
A new paper of mine (co-authored by Basia Żogała-Siudem,
Grzesiek Siudem, and Ania Cena) will appear in Scientometrics
(DOI: 10.1007/s11192-022-04345-2)2022-03-15T00:00:00+11:00https://www.gagolewski.com/_news/20220226-paper-joi-validcit.htmlValidating Citation Models by Proxy Indices2022-02-26T00:00:00+11:00<p class="ablog-post-excerpt"><p>Accepted for publication in
<em>Journal of Informetrics</em>: a new paper by Ania Cena, Basia Żogała-Siudem,
Grzesiek Siudem, and yours truly
(DOI: <a class="reference external" href="https://doi.org/10.1016/j.joi.2022.101267">10.1016/j.joi.2022.101267</a>).</p>
</p>
Accepted for publication in
Journal of Informetrics: a new paper by Ania Cena, Basia Żogała-Siudem,
Grzesiek Siudem, and yours truly
(DOI: 10.1016/j.joi.2022.101267).2022-02-26T00:00:00+11:00https://www.gagolewski.com/_news/20220219-award-ministry.htmlMinistry of Education and Science Award2022-02-19T00:00:00+11:00<p class="ablog-post-excerpt"><p>Together with a number of excellent colleagues,
I have <a class="reference external" href="https://www.gov.pl/web/edukacja-i-nauka/nagrody-ministra-edukacji-i-nauki--serdecznie-gratulujemy">received</a>
the Ministry of Education and Science, Poland,
award for significant achievements
in teaching, for the design and implementation of a new innovative
course of study – Master of Data Science –
at the Faculty of Mathematics and Information Science,
Warsaw University of Technology.</p>
</p>
Together with a number of excellent colleagues,
I have received
the Ministry of Education and Science, Poland,
award for significant achievements
in teaching, for the design and implementation of a new innovative
course of study – Master of Data Science –
at the Faculty of Mathematics and Information Science,
Warsaw University of Technology.2022-02-19T00:00:00+11:00https://www.gagolewski.com/_news/20220204-fsta-plenary.htmlInvited Lecture at FSTA 20222022-02-04T00:00:00+11:00<p class="ablog-post-excerpt"><p>I’m giving (online…) an invited lecture at <a class="reference external" href="http://www.fsta.sk/index.html">FSTA 2022</a>
today entitled <em>Clustering and aggregation</em>,
where we will examine a few scenarios where aggregation methods
can aid in the construction, analysis, and evaluation of tools related to data
clustering, including linkage criteria, partition similarity measures,
and cluster validity indices. We’ll also indicate some noteworthy challenges
for both theoretical and practical future research endeavours.</p>
</p>
I’m giving (online…) an invited lecture at FSTA 2022
today entitled Clustering and aggregation,
where we will examine a few scenarios where aggregation methods
can aid in the construction, analysis, and evaluation of tools related to data
clustering, including linkage criteria, partition similarity measures,
and cluster validity indices. We’ll also indicate some noteworthy challenges
for both theoretical and practical future research endeavours.2022-02-04T00:00:00+11:00https://www.gagolewski.com/_news/20211008-homepage-rewrite.htmlHomepage Rewrite2021-10-08T00:00:00+11:00<p class="ablog-post-excerpt"><p>My website is now generated with <a class="reference external" href="https://www.sphinx-doc.org/">Sphinx</a>.</p>
</p>
My website is now generated with Sphinx.2021-10-08T00:00:00+11:00https://www.gagolewski.com/_news/20211006-paper-cvi.htmlAre Cluster Validity Measures (In)valid?2021-10-06T00:00:00+11:00<p class="ablog-post-excerpt"><p>To appear in
<em>Information Sciences</em> — a paper
coauthored by Maciek Bartoszuk and Ania Cena
(doi:<a class="reference external" href="https://dx.doi.org/10.1016/j.ins.2021.10.004">10.1016/j.ins.2021.10.004</a>).</p>
</p>
To appear in
Information Sciences — a paper
coauthored by Maciek Bartoszuk and Ania Cena
(doi:10.1016/j.ins.2021.10.004).2021-10-06T00:00:00+11:00https://www.gagolewski.com/_news/20210927-paper-stringi.htmlPaper on stringi2021-09-27T00:00:00+10:00<p class="ablog-post-excerpt"><p>A paper on my <a class="reference external" href="https://stringi.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong></a>
package has been accepted for publication in
<em>Journal of Statistical Software</em>
(<a class="reference external" href="https://dx.doi.org/10.18637/jss.v103.i02">doi:10.18637/jss.v103.i02</a>.</p>
</p>
A paper on my stringi
package has been accepted for publication in
Journal of Statistical Software
(doi:10.18637/jss.v103.i02.2021-09-27T00:00:00+10:00https://www.gagolewski.com/_news/20210826-paper-tnorm-similar.htmlT-norms or t-conorms? How to aggregate similarity degrees for plagiarism detection2021-08-26T00:00:00+10:00<p class="ablog-post-excerpt"><p>A new paper by Maciek Bartoszuk and me is to appear in <em>Knowledge-Based Systems</em>
(doi:<a class="reference external" href="https://dx.doi.org/10.1016/j.knosys.2021.107427">10.1016/j.knosys.2021.107427</a>).</p>
</p>
A new paper by Maciek Bartoszuk and me is to appear in Knowledge-Based Systems
(doi:10.1016/j.knosys.2021.107427).2021-08-26T00:00:00+10:00https://www.gagolewski.com/_news/20210729-software-stringx.htmlstringx: Drop-in replacements for base R string functions powered by stringi2021-07-29T00:00:00+10:00<p class="ablog-post-excerpt"><p>English is the native language for only 5% of the World population.
Also, only 17% of us can understand this text. Moreover, the Latin alphabet
is the main one for merely 36% of the total. The early computer era, now a
very long time ago, was dominated by the US. Due to the proliferation of
the internet, smartphones, social media, and other technologies and
communication platforms, this is no longer the case.
The <a class="reference external" href="https://stringx.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">stringx</span></code></strong></a>
package replaces base R string functions (such as <code class="docutils literal notranslate"><span class="pre">grep()</span></code>, <code class="docutils literal notranslate"><span class="pre">tolower()</span></code>,
and <code class="docutils literal notranslate"><span class="pre">sprintf()</span></code>) with ones that fully support the Unicode standards related
to natural language processing, fixes some long-standing inconsistencies,
and introduces some new, useful features. Thanks to
<a class="reference external" href="http://site.icu-project.org">ICU</a> (International Components for Unicode)
and <a class="reference external" href="https://stringi.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong></a>,
they are fast, reliable, and portable across different platforms.
Now available from <a class="reference external" href="https://CRAN.R-project.org/package=stringx">CRAN</a>.</p>
</p>
English is the native language for only 5% of the World population.
Also, only 17% of us can understand this text. Moreover, the Latin alphabet
is the main one for merely 36% of the total. The early computer era, now a
very long time ago, was dominated by the US. Due to the proliferation of
the internet, smartphones, social media, and other technologies and
communication platforms, this is no longer the case.
The stringx
package replaces base R string functions (such as grep(), tolower(),
and sprintf()) with ones that fully support the Unicode standards related
to natural language processing, fixes some long-standing inconsistencies,
and introduces some new, useful features. Thanks to
ICU (International Components for Unicode)
and stringi,
they are fast, reliable, and portable across different platforms.
Now available from CRAN.2021-07-29T00:00:00+10:00https://www.gagolewski.com/_news/20210714-software-stringi-1-7-2.htmlstringi 1.7.22021-07-14T00:00:00+10:00<p class="ablog-post-excerpt"><p>Another major update of
<a class="reference external" href="https://stringi.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">stringi</span></code></strong></a>
brings a rewritten version of <code class="docutils literal notranslate"><span class="pre">stri_sprintf</span></code>,
support for custom rule-based transliteration,
extraction of named regex capture groups,
and many other enhancements.</p>
</p>
Another major update of
stringi
brings a rewritten version of stri_sprintf,
support for custom rule-based transliteration,
extraction of named regex capture groups,
and many other enhancements.2021-07-14T00:00:00+10:00https://www.gagolewski.com/_news/20210617-software-realtest-0-2-1.htmlrealtest 0.2.1 on CRAN2021-06-17T00:00:00+10:00<p class="ablog-post-excerpt"><p>An update to <a class="reference external" href="https://realtest.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">realtest</span></code></strong></a>
is now available.</p>
</p>
An update to realtest
is now available.2021-06-17T00:00:00+10:00https://www.gagolewski.com/_news/20210604-software-realtest.htmlrealtest: When Expectations Meet Reality: Realistic Unit Testing in R2021-06-04T00:00:00+10:00<p class="ablog-post-excerpt"><p><a class="reference external" href="https://realtest.gagolewski.com"><strong><code class="docutils literal notranslate"><span class="pre">realtest</span></code></strong></a>
is a framework for unit testing for realistic minimalists, where we distinguish
between expected, acceptable, current, fallback, ideal, or regressive behaviour.
It can also be used for monitoring other software projects for changes.
Now available on <a class="reference external" href="https://CRAN.R-project.org/package=realtest">CRAN</a>.</p>
</p>
realtest
is a framework for unit testing for realistic minimalists, where we distinguish
between expected, acceptable, current, fallback, ideal, or regressive behaviour.
It can also be used for monitoring other software projects for changes.
Now available on CRAN.2021-06-04T00:00:00+10:00https://www.gagolewski.com/_news/20210527-paper-genieclust.htmlPaper on the genieclust Python+R package2021-05-27T00:00:00+10:00<p class="ablog-post-excerpt"><p><em><a class="reference external" href="https://genieclust.gagolewski.com/">genieclust</a>: Fast and robust hierarchical clustering</em>
was accepted for publication in <em>SoftwareX</em>
(doi:<a class="reference external" href="https://dx.doi.org/10.1016/j.softx.2021.100722">10.1016/j.softx.2021.100722</a>).</p>
</p>
genieclust: Fast and robust hierarchical clustering
was accepted for publication in SoftwareX
(doi:10.1016/j.softx.2021.100722).2021-05-27T00:00:00+10:00