Agersnap, A., Schmidt, L. W.
, Eriksen, R. S., Bønding, E. W., Kirkegaard, T. H., Borčak, L. W.
& Baunvig, K. L. (2026).
The Human Touch: Leveraging HITL for Quantitative Close Reading of Historical Corpora.
Digital Humanities in the Nordic and Baltic Countries Publications,
7(4).
https://doi.org/10.5617/dhnbpub.12981
Jeganathan, J., Campbell, M. E. J.
, Legrand, N., Allen, M. & Breakspear, M. (2025).
Aberrant Cardiac Interoception in Psychosis.
Schizophrenia Bulletin,
51(1), 208-216.
https://doi.org/10.1093/schbul/sbae078
Kristensen-McLachlan, R. D., Canavan, M., Kardos, M., Jacobsen, M. & Aarøe, L. (2025).
Are Chatbots Reliable Text Annotators? Sometimes.
PNAS Nexus,
4(4), Article pgaf069.
https://doi.org/10.1093/pnasnexus/pgaf069
Bernstorff, M., Hansen, L., Enevoldsen, K., Damgaard, J.
, Hæstrup, F., Perfalk, E., Danielsen, A. A.
& Østergaard, S. D. (2025).
Development and validation of a machine learning model for prediction of type 2 diabetes in patients with mental illness.
Acta Psychiatrica Scandinavica,
151(3), 245-258.
https://doi.org/10.1111/acps.13687
Enevoldsen, K., Jensen, K. N.
, Kostkan, J., Szabo, B.
, Kardos, M., Vad, K., Heinsen, J., Núñez, A. B., Barmina, G., Nielse, J., Larsen, R.
, Vahlstrup, P. B., Møldrup-Dalum, P., Elliot, D., Galke, L., Schneider-Kamp, P.
& Nielbo, K. L. (2025).
Dynaword: From One-shot to Continuously Developed Datasets. ArXiv.
https://arxiv.org/abs/2508.02271
Holur, P.
, Enevoldsen, K. C., Rajesh, S., Mboning, L., Georgiou, T., Bouchard, L. S., Pellegrini, M. & Roychowdhury, V. (2025).
Embed-Search-Align: DNA sequence alignment using Transformer models.
Bioinformatics,
41(3), Article btaf041.
https://doi.org/10.1093/bioinformatics/btaf041
Feldkamp, P., Lassche, A., Baunvig, K. F., Nielbo, K. & Bizzoni, Y. (2025).
Fact from Fiction: Finding Serialized Novels in Newspapers. In J. Zhao, M. Wang & Z. Liu (Eds.),
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop) (pp. 695-707). Association for Computational Linguistics. Advance online publication.
https://aclanthology.org/2025.acl-srw.45/
Hansen, L. B. P., Eriksen, R. S., Feldkamp, P., Lassche, A., Nielbo, K. L., Baunvig, K. L. & Bizzoni, Y. (2025).
Framing the Canon: A Computational Study of Canonicity in Danish Golden Age Paintings (1750-1870). In
Anthology of Computers and the Humanities (Vol. 3, pp. 339-356)
https://doi.org/10.63744/KTLpQIY247dD
de Soto, P.
, Pažout, A., Brughmans, T., Vahlstrup, P. B., Auir, Á., Bongers, T., Christoffersen, J. E. B., Crépy, M., Johansen, M. H., Lewis, J., Manière, L., Massa, M. R.
, Møller, L. M. H., Redon, B., Renda, G., Şahin, H.
, Sobotková, A., Spatzek, A. L., Verhagen, P. & Weissova, B. (2025).
Itiner-e: A high-resolution dataset of roads of the Roman Empire.
Scientific Data,
12(1), Article 1731.
https://doi.org/10.1038/s41597-025-06140-z
Enevoldsen, K., Chung, I., Kerboua, I.
, Kardos, M., Mathur, A., Stap, D., Gala, J., Siblini, W., Krzeminski, D., Winata, G. I., Sturua, S., Utpala, S., Ciancone, M., Schaeffer, M., Sequeira, G., Misra, D., Dhakal, S., Rystrøm, J., Solomatin, R. ... Günther, M. (2025).
MMTEB: Massive Multilingual Text Embedding Benchmark. In
13th International Conference on Learning Representations, ICLR 2025 (pp. 102004-102060). International Conference on Learning Representations.
https://openreview.net/pdf?id=zl3pfz4VCV