20.10. What makes the meaning of two texts similar? Linguistic phenomena behind cross-level semantic similarity in Serbian

Mittwoch, 05.10.2022

Maja Miličević Petrović; University of Bologna

Establishing whether two text items have similar meanings is a central task in natural language processing. In linguistics, on the other hand, semantic similarity is less typically approached as a unique concept, despite being very much studied in relation to specific phenomena such as synonyms or diathesis alternations. In this talk, I will discuss the potential usefulness of tackling semantic similarity more broadly, in terms of linguistic analysis and with a view of contributing to natural language processing. I will present a taxonomy of semantic similarity types and indicators based on previously proposed classifications of paraphrase (Vila Rigat 2012, Vila et al. 2014, Milićević 2007, Mel’čuk 2012) and illustrate it through examples from two sets of Serbian data, newswire texts and software code comments (Miličević Petrović et al. 2022). A doubly cross-level perspective will be outlined, as the datasets contain pairs of texts of different lengths (phrase-sentence and sentence-paragraph), and similarity indicators from different levels of linguistic structure will be considered.



Zeit: 20. Oktober 2022, 17.15 Uhr, Ort: Merangasse 70, 1. Stock, Raum 33.1.224




