User contributions for Eugenio

A user with 162 edits. Account created on 10 January 2024.
Search for contributionsExpandCollapse
⧼contribs-top⧽
⧼contribs-date⧽
(newest | oldest) View ( | older 50) (20 | 50 | 100 | 250 | 500)

11 January 2024

10 January 2024

  • 23:5423:54, 10 January 2024 diff hist 0 N File:String.pngNo edit summary current
  • 23:3623:36, 10 January 2024 diff hist +5,393 N String:General disclaimerCreated page with "The information in the STRING webpage is addressed to the general public for the purpose of divulging the research activities and results of the STRING development team, which is part of the Spoken Language Laboratory at <span class="plainlinks">[http://www.inesc-id.pt INESC-ID Lisboa]</span>. Information presented on this website is considered public information (unless otherwise noted) and may be distributed or copied under the terms of app..." current
  • 23:3423:34, 10 January 2024 diff hist +1 m Compound AdverbsNo edit summary
  • 23:3423:34, 10 January 2024 diff hist 0 Compound AdverbsNo edit summary
  • 23:3323:33, 10 January 2024 diff hist +62 Compound AdverbsNo edit summary
  • 23:3223:32, 10 January 2024 diff hist +174 N File:PortugueseCompoundAdverbs.xlsxA list of the 300 most frequent compound (multi-word) adverbs that are common to both the Brazilian (BP) and European (EP) varieties of the Portuguese language. current
  • 15:4915:49, 10 January 2024 diff hist +140 N OtherCreated page with "This page is really empty! Please visit us soon.... <blockquote><blockquote> file:UnderConstruction.png </blockquote></blockquote>" current
  • 15:3415:34, 10 January 2024 diff hist +471 N GrammarCreated page with "==== Description ==== * Gramáticas locais: 3164 rules * Chunker: 344 rules * Dependências: 1613 rules ==== Local Grammars ==== * LGAbstraction * LGAdvérbios * LGCulture * LGDatum * LGElectronic * LGEvent * LGLocation * LGMeasure * LGNumber * LGOrg * LGPeople * LGProfession * LGPronouns * LGRelatives * LGSports * LGTime ==== Dependencies ==== * Auxiliary * Syntactic * BuildingLocation * BusinessRelations * Family * FixedPhrase * Lifetime * PeopleLocation * Time" current
  • 15:3115:31, 10 January 2024 diff hist +324 N DisambiguationCreated page with "==== Description ==== Regras de descontracção: 178 Regras de desambiguação: 188 ==== Disambiguation ==== * Disamb * DisambAdjNoun * DisambAdjVerb * DisambAdv * DisambArtPron * DisambDLF * DisambExpandLast * DisambIdiomatic * DisambLast * DisambLemma * DisambPastPartNoun * DisambPrefix * DisambVerb * DisambVerbNoun" current
  • 15:2715:27, 10 January 2024 diff hist +1,133 N TransferCreated page with "==== Unbabel ==== [http://unbabel.com/ Unbabel] uses the Portuguese Named Entities Recognition modules of STRING for the ''anonymisation'' (or ''de-identification'') and the ''re-identification'' of named entities in the distributed translation process. Anonymisation is required for dealing with privacy issues whenever sensitive data sharing is involved, as in the [http://unbabel.com/ Unbabel] crowdsourcing translation service. ==== OOBIAN ==== Main_Pa..." current
  • 14:5614:56, 10 January 2024 diff hist +41 Compound AdverbsNo edit summary
  • 14:5314:53, 10 January 2024 diff hist +174 N File:PortugueseCompoundAdverbs.pdfA list of the 300 most frequent compound (multi-word) adverbs that are common to both the Brazilian (BP) and European (EP) varieties of the Portuguese language. current
  • 14:5214:52, 10 January 2024 diff hist +26 N Compound AdverbsCreated page with "This is a list of adverbs."
  • 14:2414:24, 10 January 2024 diff hist +29,172 N DictionariesCreated page with "<div style="float:right;">__TOC__</div> === Description === STRING operates based on large-sized, comprehensive, highly granular lexical resources. Much emphasis is put in building them, under the conviction that the lexicon is key to many NLP tasks and applications. This page, constantly under construction, describes briefly the main resources already available and being used by STRING. === LexMan Dictionary === LexMan uses a dictionary of lemmas containing, for the m..."
  • 14:1214:12, 10 January 2024 diff hist +13,158 N CorporaCreated page with "=== Zero Anaphora Corpus (ZAC) === <div style="float:right;">__TOC__</div> ZAC - Zero Anaphora Corpus is a corpus of Brazilian Portuguese texts built in view of the construction of an Anaphora Resolution system, which is part of the STRING system. The ZAC corpus is aimed at the resolution of the so-called zero-anaphora, that is, an anaphora relation where the anaphoric expression (or anaphor) has been zeroed. In the following, we briefly present the main linguistic asp..."
  • 13:5413:54, 10 January 2024 diff hist 0 MARv4No edit summary current
  • 13:5213:52, 10 January 2024 diff hist +5,752 N MARv4Created page with "<div style="float:right;">__TOC__</div> ==== Acronym ==== '''''MARv''''' stands for '''M'''orphossyntactic '''A'''mbiguity '''R'''esol'''v'''er ==== Introduction ==== MARv2's architecture comprehends two submodules: a set of linguistically-oriented disambiguation rules module and a probabilistic disambiguation module. The linguistic-oriented is no longer used in the STRING chain because that function is now implemented by the RuDriCo module. MARv2..."
  • 13:4813:48, 10 January 2024 diff hist +2,936 N InverseDicCreated page with "{{DISPLAYTITLE: Inverse Vocabulary of Contemporary Portuguese (InVoc-PT)}} === Presentation === <div style="float:right;">__TOC__</div> An inverse vocabulary is a particular type of vocabulary in which words are presented in alphabetical order but sorted from the last to first first character. For example, here are some words (non-contiguous in the alphabet) in the order as they are shown in the inverse vocabulary: aba, alba, alga, malga, salga, ala, bala, pala, tala, e..." current
  • 13:3313:33, 10 January 2024 diff hist +5,984 N RuDriCo2Created page with "<div style="float:right;">__TOC__</div> ==== Acronym ==== '''''RuDriCo''''' stands for '''''Ru'''''le '''''Dri'''''ven '''''Co'''''nverter ==== Brief Description ==== RuDriCo2's main goal is to provide for an adjustment of the results produced by the LexMan morphological analyzer to the specific needs of each parser. In order to achieve this, it modifies the segmentation that is done by the former. For example, it might contract expressions provided by the morp..." current
  • 13:3113:31, 10 January 2024 diff hist +2,134 N LexManCreated page with "<div style="float:right;">__TOC__</div> ==== Acronym ==== '''''LexMan''''' stands for '''Lex'''ical '''M'''orphological '''an'''alyzer ==== Brief Description ==== LexMan is responsible for according to each token its part-of-speech (POS) and any other relevant morphosyntactic feature (gender, number, tense, mood, case, degree, etc.), using [http://en.wikipedia.org/wiki/Finite_state_transducer finite state transducers]. LexMan uses very rich, highly granular ta..." current
  • 13:2413:24, 10 January 2024 diff hist +1,040 N ContactCreated page with "Any comments, suggestions, doubts or ideas, please contact us! We would like to hear from you! We are located in Lisbon, [http://www.l2f.inesc-id.pt/wiki/index.php/Location near the Saldanha area].<br> A general path finder is [http://www.transporlis.sapo.pt/index.cfm here]. Special options can be found [http://www.l2f.inesc-id.pt/wiki/index.php/Contacts_and_Directions here]. ==== Contacts ==== {| width="400" cellspacing="2" cellpadding="2" |- ! width="16%" valign="TO..." current
  • 13:2213:22, 10 January 2024 diff hist +20,426 N XIPCreated page with "<div style="float:right;">__TOC__</div> ==== Acronym ==== '''''XIP''''' stands for '''''X'''''EROX '''''I'''''ncremental '''''P'''''arsing ==== Introduction ==== XIP is a <span class="plainlinks">[http://www.xrce.xerox.com/Research-Development/Document-Content-Laboratory/Parsing-Semantics/Robust-Parsing XEROX]</span> parser, based on finite-state technology and able to perform several tasks, namely: * adding lexical, syntactic and semantic information; * applying..." current
  • 13:1513:15, 10 January 2024 diff hist +24,448 N PublicationsCreated page with "<div style="float:right;">__TOC__</div> ====in 2016==== '''[73]''' Francisco Dias [http://www.inesc-id.pt/ficheiros/publicacoes/10593.pdf Multilingual Automated Text Anonymization]. MSc thesis, Instituto Superior Técnico, Universidade Técnica de Lisboa, Lisboa, Portugal, June 2016 (bibtex) '''[72]''' Joana Pinto [http://www.inesc-id.pt/ficheiros/publicacoes/10639.pdf Fine-grained POS-tagging: Full disambiguation of verbal morpho-synta..." current
  • 13:1113:11, 10 January 2024 diff hist +181 N Template:CoordinatorCreated page with "{| width='100%' cellspacing='0' | '''{{{name}}}''' |- | style='vertical-align: top; font-size: 12px; text-align: justify;' | [[Image:{{{photo}}}|right|top|border|130px]] {{{cv}}} |}" current
  • 13:1013:10, 10 January 2024 diff hist +201 N Template:TeammemberCreated page with "{| width='100%' cellspacing='0' | '''{{{name}}}''' |- | style='vertical-align: top; font-size: 12px; text-align: justify;' | [[Image:{{{photo}}}|right|top|border|130px]] {{{work}}} <br />[{{{pub}}}] |}" current
  • 13:0713:07, 10 January 2024 diff hist +40,630 N TeamCreated page with "== Coordination == {| width="100%" valign="top" cellpadding="10px" |style="vertical-align: top; text-align: left; width: 35%;" | {{Coordinator |name=[http://www.l2f.inesc-id.pt/wiki/index.php/Nuno_Mamede Nuno Mamede] (Computer Science Coordination) |photo=Nuno.png |cv=Nuno J. Mamede received his graduation, MSc and PhD degrees in Electrical and Computer Engineering by the [http://www.ist.utl.pt Instituto Superior Técnico], Lisbon, in 1981, 1985 and 1992, respectively...." current
  • 13:0313:03, 10 January 2024 diff hist +8,709 N ArchitectureCreated page with "<div style="float:right;">__TOC__</div> '''STRING''' is a '''St'''atistical and '''R'''ule-Based '''N'''atural Lan'''g'''uage Processing Chain for Portuguese developed at <span class="plainlinks">[https://www.hlt.inesc-id.pt/wiki/ HLT]</span> and it consists of several modules, which are represented in the next figure: 800px ==== Tokenizer ==== The first module is responsible for text segmentation, and it divides the text into tokens. Besides..." current
  • 13:0013:00, 10 January 2024 diff hist 0 m Main PageNo edit summary current
  • 11:5811:58, 10 January 2024 diff hist +1,101 Main PageNo edit summary
  • 11:5011:50, 10 January 2024 diff hist +442 N MediaWiki:SidebarCreated page with "* Navigation ** Main_Page|STRING ** architecture|architecture ** team|team ** publications|publications ** transfer|technology transfer ** https://string.hlt.inesc-id.pt/demo|demo ** contact|contact * Modules ** LexMan|LexMan ** RuDriCo2|RuDriCo2 ** MARv4|MARv4 ** XIP|XIP ** other|other * Lexical Resources ** dictionaries|dictionaries ** InverseDic|inverse dictionary ** disambiguation|disambiguation ** grammar|grammar ** corpora|corpora" current
(newest | oldest) View ( | older 50) (20 | 50 | 100 | 250 | 500)