About¶

Prof. Marek Gagolewski

(pronounced like: Maa’rek (Mark) Gong-o-leaf-ski)

Associate Professor in Data Science
Academic Director for Data Science Studies, etc.

🏛 Department of Computational Statistics and Data Analysis, Faculty of Mathematics and Information Science, Warsaw University of Technology, Poland
📪 ul. Koszykowa 75, 00-662 Warsaw, Poland (Math Building-MiNI, room 550; see my timetable)
and
🏛 Systems Research Institute, Polish Academy of Sciences
📪 ul. Newelska 6, 01-447 Warsaw, Poland

🆔 ORCID: 0000-0003-0637-6028

✉️ Emails (pick one – and only one): ○ marek▮gagolewski▯com (main) ○ marek▯gagolewski▮pw▯edu▯pl (academic)

📚 Open-access textbooks: ○ Deep R Programming ○ Minimalist Data Wrangling with Python
🖥️ Open-source software: ○ stringi ○ lumbermark ○ deadwood ○ genieclust ○ quitefastmst ○ stringx ○ realtest ○ clustering-benchmarks
🧐 See also: ○ GitHub ○ StackOverflow ○ G**gle Scholar ○ Vitæ ○ MADAM Seminar

Highlights¶

🔍 Researcher in data science (with emphasis on mathematical modelling of complex phenomena and developing usable data analysis/machine learning software)
- Area editor in Fuzzy Sets and Systems (aggregation functions and data science)
- Research interests: computational and applied statistics; data science algorithms and data structures; machine learning; artificial intelligence; data mining on networks/graphs; data fusion, aggregation, and clustering; mathematical modelling in informetrics, bibliometrics, psychometrics, sports analytics, economics, social sciences, and science of science; systems research
- Author/editor of ~100 publications, including journal papers in outlets such as Proceedings of the National Academy of Sciences (PNAS), Journal of Statistical Software, R Journal, Journal of Classification, Information Fusion, International Journal of Forecasting, Statistical Modelling, Physica A: Statistical Mechanics and Its Applications, Information Sciences, Knowledge-Based Systems, IEEE Transactions on Fuzzy Systems, and Journal of Informetrics
💻 Open-source data analysis/machine learning/statistical software developer
- Author and maintainer of the fast&robust Genie, Lumbermark, and Deadwood clustering and anomaly detection algorithms; see Python and R packages genieclust, lumbermark, deadwood
- Author and maintainer of stringi – one of the most often downloaded R packages (text/natural language processing)
🎓 Data science, machine learning, and statistical computing lecturer
- Author of the open-access textbooks Deep R Programming and Minimalist Data Wrangling with Python
- Current: Warsaw University of Technology, Poland
- Past: Deakin University (Melbourne, Australia) and Data Science Retreat (Berlin, Germany)