Danish Foundation Models - DFM


Danish Foundation Models – Empowering the Danish Language in the Digital Age

The Danish Foundation Models (DFM) is a cooperative national initiative aimed at developing high-quality, open-access AI foundation models specifically tailored for the Danish language. In collaboration with leading academic and industrial partners, DFM is building tools to support the pre-training, fine-tuning, and evaluation of advanced Danish language models for both text and speech technologies. This initiative addresses the growing risk that smaller languages like Danish may be underrepresented in the global AI landscape, which might negatively impact future application of AI in for example healthcare and the public sector. 

DFM promotes a sustainable, inclusive, and transparent approach to AI development by emphasizing data privacy, ethical use, and open-documentation. It seeks to empower public institutions, researchers, and private companies by ensuring that Danish language technologies meet sector-specific demands and are usable across a wide array of domains from public administration and healthcare to education and commercial enterprises.

Role of Center for Humanities Computing  

The DFM project is co-led by the Center for Humanities Computing in collaboration with the University of Southern Denmark, the University of Copenhagen, and the Alexandra Institute. A core team of infrastructure and AI specialists from these institutions contribute to the project.

CHC is responsible for:

  • Developing state-of-the-art Danish language models using ethical training regimes
  • Designing and implementing secure pipelines for pre-training and evaluation
  • Ensuring robust model documentation through tools such as model cards and datasheets
  • Contributing to benchmarks and datasets that support fine-grained evaluation of Danish NLP capabilities
  • Building tools and applications that enhance community-driven use-case development and reproducibility

Project affiliation


Funding

The project is supported by: 

Ministry of Digital Affairs with DKK 30,700,000


Project Duration

2024 - 2027


Collaboration and Partnership

Collaborate with our Research Software Engineers, Data Scientists or Data Managers


Services and Support

Contact us by submitting a ticket with the CHC frontoffice