
Porttinari - a Large Multi-genre Treebank for Brazilian Portuguese
Author(s) -
Thiago Alexandre Salgueiro Pardo,
Magali Sanches Duran,
Lucelene Lopes,
Ariani Di Felippo,
Norton Trevisan Roman,
Maria das Graças Volpe Nunes
Publication year - 2021
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5753/stil.2021.17778
Subject(s) - treebank , computer science , portuguese , parsing , natural language processing , syntax , artificial intelligence , brazilian portuguese , linguistics , annotation , philosophy
This paper presents the project of a large multi-genre treebank for Brazilian Portuguese, called Porttinari. We address relevant research questions in its construction and annotation, reporting the work already done. The treebank is affiliated with the “Universal Dependencies” international model, widely adopted in the area, and must be the basis for the development of state of the art tagging and parsing systems for Portuguese, as well as for conducting linguistic studies on morphosyntax and syntax for this language.