About¶
Prof. Marek Gagolewski
(pronounced like: Maa’rek (Mark) Gong-o-leaf-ski)
Associate Professor in Data Science
Academic Director for Data Science and Engineering Studies (Dean’s Proxy)
🏛 Department of Computational Statistics and Data Analysis, Faculty of Mathematics and Information Science, Warsaw University of Technology, Poland
📪 ul. Koszykowa 75, 00-662 Warsaw, Poland (Math Building-MiNI, room 550; see my timetable)
and
🏛 Systems Research Institute, Polish Academy of Sciences
📪 ul. Newelska 6, 01-447 Warsaw, Poland
🆔 ORCID: 0000-0003-0637-6028
✉️ Emails (pick one – and only one): ○ marek▮gagolewski▯com (main) ○ marek▯gagolewski▮pw▯edu▯pl (academic)
📚 Open-access textbooks:
○ Deep R Programming
○ Minimalist Data Wrangling with Python
🖥️ Open-source software:
○ stringi
○ genieclust
○ clustering-benchmarks
○ stringx
○ realtest
🧐 See also:
○ GitHub
○ StackOverflow
○ G**gle Scholar
○ Vitæ
○ MADAM Seminar
Highlights¶
🔍 Researcher in data science (with emphasis on mathematical modelling of complex phenomena and developing usable, general-purpose algorithms and data analysis software)
Area editor in Fuzzy Sets and Systems (aggregation functions and data science)
Research interests: computational and applied statistics; machine learning; data mining on networks/graphs; data fusion, aggregation, and clustering; mathematical modelling in informetrics, bibliometrics, psychometrics, sports analytics, economics, social sciences, and science of science; systems research
Author/editor of ~100 publications, including journal papers in outlets such as Proceedings of the National Academy of Sciences (PNAS), Journal of Statistical Software, R Journal, Journal of Classification, Information Fusion, International Journal of Forecasting, Statistical Modelling, Physica A: Statistical Mechanics and Its Applications, Information Sciences, Knowledge-Based Systems, IEEE Transactions on Fuzzy Systems, and Journal of Informetrics
💻 Free (libre) and open-source data analysis software developer
Author and maintainer of the fast&robust Genie hierarchical clustering algorithm; see Python and R package
genieclust
Author and maintainer of
stringi
– one of the most often downloaded R packages (text/natural language processing)
🎓 Data science, machine learning, and statistical computing lecturer
Author of the open-access textbooks Deep R Programming and Minimalist Data Wrangling with Python
Current: Warsaw University of Technology, Poland
Past: Deakin University (Melbourne) and Data Science Retreat (Berlin)