Improving automated content analysis with news-specific word embeddings for medium-resourced languages

Name: Improving automated content analysis with news-specific word embeddings for medium-resourced languages
Start: 2019-02-07T09:30:00Z
Location: Amsterdam

Anne C Kroon, Damian Trilling, Antske Fokkens, Felicia Loecherbach, Judith Moeller, Mariken Van der Velden, Wouter van Atteveldt

Image credit: Unsplash

Abstract

In this contribution, we investigate whether it is worth the effort to train a custom model rather than relying on (limited) available pre-trained models. For the case of Dutch, few embedding models are available, and they are trained on ordinary human language from the World Wide Web. These models capture the specifics of news article data less well and are therefore likely to be less suitable to study and understand dynamics of this domain.

Date

Feb 7, 2019 9:30 AM

Event

5th International Conference on Computational Social Science IC2S2

Location

Amsterdam

word embeddings automated content analysis news small languages

Improving automated content analysis with news-specific word embeddings for medium-resourced languages

Abstract

Felicia Loecherbach

Assistant Professor Political Communication and Journalism

Related