Research

Here you can find previous literature related to these topics. The future research publications produced in the DataLit project will be added here as well.

Prior literature

Pääkkönen, Juho & Ylikoski, Petri. (2020) Humanistic interpretation and machine learning. Synthese.

Rusanen, Anna-Mari & Ylikoski, Petri. (2017) Algoritmit, tekoäly – tieteen murros? Futura 2/2017: 15-24.

Tupasela, Aaro, Snell, Karoliina & Tarkkala, Heta. (2020) The Nordic data imaginary. Big Data & Society.

Tupasela, Aaro & Di Nucci, Ezio. (2020) Concordance as evidence in the Watson for Oncology decision-support system. AI & SOCIETY, 35(4), 811-818.

Choroszewicz, Marta & Mäihäniemi, Beata. (2020) Developing a digital welfare state: Data protection legislation and the use of automated decision-making across six EU countries. Global Perspectives 1 (1).

DataLit publications

Machine learning and reliable inferences
  • Gao, Y., Ji, S., Zhang, T., Tiwari, P., and Marttinen, P. (2022). Contextualized graph embeddings for adverse drug event detection. The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2022). Link.
  • Cui, T., Kumar, Y., Marttinen, P., and Kaski, S. (2022). Deconfounded Representation Similarity for Comparison of Neural Networks. Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022). Link. 
  • Rissanen, S. and Marttinen, P. (2021). A Critical Look at the Consistency of Causal Estimation with Deep Latent Variable Models. Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021). Link.
  • Cui, T., Havulinna, A., Marttinen, P., and Kaski, S. (2021). Informative Bayesian Neural Network Priors for Weak Signals. Bayesian Analysis, 1-31. Link.
  • Sun, W, Ji, S., Cambria, E., and Marttinen, P. (2021). Multitask recalibrated aggregation network for medical code prediction. The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021). Link.
Privacy and synthetic data
Legal and ethical data use
Presentation and visualisation of data
Public sector as data user