Open Source
Telenor Nordics Customer Service self-help corpus
A multilingual customer service self-help corpus has been released, consisting of 1,122 validated documents in Finnish, Danish, Norwegian, and Swedish, with a total of 274,599 words. This dataset addresses the scarcity of domain-specific resources for Nordic languages, particularly in customer service, and is designed to enhance retrieval-augmented generation, cross-lingual transfer learning, and agent-based service architectures. The corpus is publicly available, promoting reproducible research in Nordic NLP and information retrieval.
customer-serviceself-helpmultilingual