Coordination of -mente ending adverbs in Portuguese: an integrated solution

Jorge Baptista, Lucas Nunes Vieira, Cláudio Diniz, Nuno Mamede

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

265 Downloads (Pure)

Abstract

Portuguese -mente ending adverbs constitute a large, morphologically homogenous, but syntactically and semantically diverse lexical set. When coordinated, the first adverb loses the adverbial suffix and takes the shape of the base adjective, in the feminine-singular form. This raises the issue of its part-of-speech (POS) classification (adverb or adjective?), but especially its adequate parsing, since it may then be incorrectly analyzed as a modifier of a preceding noun. However, the POS tagging can not be adequately performed prior to some minimal syntactic analysis. The size of the lexicon involved (more than 7,000 adverbs) and the scarcity of instances, even in large corpora, make it ineffective to leave only for the POS tagger the task of solving this adjective/reduced adverbial form ambiguity. This paper proposes an integrated solution, where a rule-base disambiguating module and a POS statistical tagger combine to produce more accurate tagging and better parsing results to this non-trivial empirical problem. The system was evaluated on a large-sized corpus.
Original languageEnglish
Title of host publicationPROPOR 2012
Subtitle of host publicationComputational Processing of the Portuguese Language
EditorsHelena Caseli, Aline Villavicencio, Antonio Teixeira, Fernando Perdigão
PublisherSpringer Berlin Heidelberg
Pages24-34
Number of pages11
ISBN (Electronic)9783642288852
ISBN (Print)9783642288845
DOIs
Publication statusPublished - 2012
EventInternational Conference on Computational Processing of Portuguese - Coimbra, Portugal
Duration: 17 Apr 201220 Apr 2012

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Berlin Heidelberg
Volume7243
ISSN (Print)0302-9743

Conference

ConferenceInternational Conference on Computational Processing of Portuguese
Abbreviated titlePROPOR
CountryPortugal
Period17/04/1220/04/12

Keywords

  • Adverb
  • Coordination
  • POS disambiguation
  • Parsing
  • Dependency

Cite this