Research


Our research focuses on computational analysis of complex natural and social systems. There is a great demand for targeted computational techniques to extract information and insights from rich data collections based on clever combinations of human and machine intelligence. We blend elements from fields such as machine learning/AI, probabilistic programming, statistical ecology, and data science, and drive open developer communities that help to translate latest theoretical advances into accessible methods to inform modeling, experimentation, and decision-making. For a full list of publications check this page.

Selected examples

Bibliographic data science and the history of the book (c. 1500–1800)
Lahti L, Marjanen J, Roivainen H & Tolonen M.
Cataloging & Classification Quarterly 57(1) Routledge, 2019
Special issue.
10.1080/01639374.2018.1543747 | PDF

Quantifying bias and uncertainty in historical data collections with probabilistic programming
Lahti L, Mäkelä E & Tolonen M.
Proc. CEUR Workshop Proceedings on Computational Humanities Research 2020
PDF

More publications in computational humanities

An analysis of the current bibliographical data landscape in the humanities. A case for the joint bibliodata agendas of public stakeholders
Umerle T, Colavizza G, Herden E, Jagersma R, Király P, Koper B, Lahti L, Lindemann D, Łubocki J, Malínek V, Milanova A, Péter R, Rißler-Pipka N, Romanello M, Roszkowski M, Siwecka D, Tolonen M & Vimr O.
Zenodo 2022
10.5281/zenodo.6559857

Examining the early modern canon: The english short title catalogue and large-scale patterns of cultural production.
Tolonen M, Hill M, Ijaz A, Vaara V & Lahti L.
Data visualization in enlightenment literature and culture Palgrave Macmillan, Cham., 2021
Book chapter in: Data Visualization in Enlightenment Literature and Culture
10.1007/978-3-030-54913-8_3

Probabilistic analysis of early modern british book prices
Tiihonen I, Tolonen M & Lahti L.
Proc. CEUR Workshop Proceedings on Computational Humanities Research 2989, 2021
In M Ehrmann, F Karsdorp et al. (eds).
PDF

Corpus linguistics and eighteenth century collections online (ECCO)
Tolonen M, Mäkelä E, Ijaz A & Lahti L.
Research in Corpus Linguistics 9(1), 2021
10.32714/ricl.09.01.03

Quantifying bias and uncertainty in historical data collections with probabilistic programming
Lahti L, Mäkelä E & Tolonen M.
Proc. CEUR Workshop Proceedings on Computational Humanities Research 2020
PDF

Wrangling with non-standard data
Mäkelä E, Lagus K, Lahti L, Säily T, Tolonen M, Hämäläinen M, Kaislaniemi S & Nevalainen T.
Proc. Digital humanities in the nordic countries 2612, 2020
PDF

Reconstructing intellectual networks: From the ESTC’s bibliographic metadata to historical material
Hill M, Vaara V, Säily T, Lahti L & Tolonen M.
Proc. Digital humanities in the nordic countries 2364, 2019
Best paper award.
PDF

Analytical edition detection in bibliographic metadata
Ijaz A, Roivainen H & Lahti L.
Proceedings of the digital humanities (DH2019) 2019
In press.

Analytical determination of editions from bibliographic metadata
Ijaz A, Tolonen M & Lahti L.
Proceedings of the research data in the humanities conference 2019
PDF

Bibliographic data science and the history of the book (c. 1500–1800)
Lahti L, Marjanen J, Roivainen H & Tolonen M.
Cataloging & Classification Quarterly 57(1) Routledge, 2019
Special issue.
10.1080/01639374.2018.1543747 | PDF

Best practices in bibliographic data science
Lahti L, Vaara V, Marjanen J & Tolonen M.
Proceedings of the research data in the humanities conference 2019
PDF

Interdisciplinary collaboration in studying newspaper materiality
Mäkelä E, Tolonen M, Marjanen J, Kanner A, Vaara V & Lahti L.
Proc. Twin talks workshop. Digital humanities in the nordic countries 2365, 2019
PDF

A national public sphere? Analyzing the language, location, and form of newspapers in finland, 1771–1917
Marjanen J, Vaara V, Kanner A, Roivainen H, Mäkelä E, Lahti L & Tolonen M.
Journal of European Periodical Studies 4(1), 2019
Special issue: Digital Approaches Towards Serial Publications
10.21825/jeps.v4i1.10483 | PDF

Scaling up bibliographic data science
Tolonen M, Marjanen J, Roivainen H & Lahti L.
Proceedings of the digital humanities in the nordics (DHN2019) 2019
PDF

The emerging paradigm of bibliographic data science
Vaara V, Ijaz A, Tiihonen I, Hengchen S, Kanner A, Säily T & Lahti L.
Proceedings of the digital humanities (DH2019) 2019
In press.

A quantitative approach to book-printing in sweden and finland, 1640–1828
Tolonen M, Lahti L, Roivainen H & Marjanen J.
Historical Methods: A Journal of Quantitative and Interdisciplinary History 52 Routledge, 2019
10.1080/01615440.2018.1526657 | PDF

Spheres of “public” in eighteenth-century britain
Hill M, Kanner A, Marjanen J, Vaara V, Mäkelä E, Lahti L & Tolonen M.
Proceedings of the digital humanities in the nordics conference 2018

Digitaaliset ihmistieteet ja historiantutkimus
Tolonen M & Lahti L.
Menneisyyden rakentajat Gaudeamus, 2018
(in Finnish). In: Hannikainen, MO and Danielsbacka, M and Tepora, T (eds.). Menneisyyden rakentajat. Gaudeamus, Helsinki, 2018.
URL

Retrieval and analysis of eurostat open data with the eurostat package
Lahti L, Huovari J, Kainu M & Biecek P.
The R Journal 9(1), 2017
PDF

Alchemy & algorithms: Perspectives on the philosophy and history of open science
Lahti L, Silva F, Laine M, Lähteenoja V & Tolonen M.
RIO Journal 3, 2017
Review of the international conference, including electronic material (video interviews, lectures and podcasts.
10.3897/rio.3.e13593 | PDF

Printing in a periphery: A quantitative study of finnish knowledge production, 1640-1828
Tolonen M, Ilomäki N, Roivainen H & Lahti L.
Digital humanities 2016: Conference abstracts Jagiellonian University & Pedagogical University, Kraków, 2016
URL

A quantitative study of history in the english short-title catalogue (ESTC) 1470-1800
Lahti L, Ilomäki N & Tolonen M.
LIBER Quarterly 25(2), 2015
10.18352/lq.10112 | PDF

Aatehistoria ja digitaalisten aineistojen mahdollisuudet
Tolonen M & Lahti L.
Ennen & Nyt 2 2, 2015
PDF