Latest News

EUROPHRAS 2019

June, 2019

We are going to present two papers at EUROPHRAS 2019, in Málaga (September, 2019).

ACL 2019

May, 2019

We are going to present four papers at ACL 2019, in Florence (July-August, 2019): one at the main conference and three in co-located workshops.

eLex 2019

May, 2019

We are going to present a paper at eLex2019 (electronic lexicography in the 21st century): Smart Lexicography, in Sintra (October, 2019)

TIAD 2019

May, 2019

We are presenting a paper at the shared task Translation Inference Across Dictionaries (TIAD 2019), in Leipzig.

Automatic Extraction of Multilingual Collocation Equivalents

This project aims to automatically extract massive instances of multilingual collocation equivalents in Portuguese, Spanish, and English. These multilingual collocations are useful to both improve second language learning and to enrich machine translation systems.

The collocations are extracted from parallel, comparable, and monolingual corpora, combining dependency parsing and statistical association measures with distributional semantics techniques.

The project is being developed by members of LyS (Language and Information Society) Group at the Faculty of Philology (UdC), from September 2017 to June 2019.

This project is supported by a 2017 Leonardo Grant for Researchers and Cultural Creators, BBVA Foundation. The Foundation takes no responsibility for the opinions, statements and visual content of the project, which are entirely the responsibility of its authors.

We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.

Automatic Extraction of Multilingual Collocation Equivalents

Work produced with the support of a 2017 Leonardo Grant for Researchers and Cultural Creators, BBVA Foundation.

Latest News

EUROPHRAS 2019

June, 2019

ACL 2019

May, 2019

eLex 2019

May, 2019

TIAD 2019

May, 2019

Automatic Extraction of Multilingual Collocation Equivalents