Porlex v2 (see Gomes & Castro, 2003) is a computerized lexical database in European Portuguese containing psycholinguistic and cognitive information that is useful to select stimulus materials for experiments and/or training vocabularies. It was built on the basis of a middle-sized adult lexicon, and provides orthographic, phonological, phonetic, part-of-speech, and neighborhood information for about 27,374 words (uninflected content words and inflected function words), presented as an Excel file.
Porlex words come from the Dicionário Universal Fundamental (Texto Editora, 1998) that was selected because of its size. In the process of compiling the remaining source informations, it became clear that the final selection of the lexical entries had to be fine tuned. For that purpose, we used Porto Editora (Costa & Melo, 1997) and Cândido de Figueiredo dictionaries (1996), as well as the grammars of Cunha and Cintra (1987), Mateus, Brito, Duarte and Faria (1989), and Vilela (1995).
Lexical entries, grammatical and morphological classification, syllabication, phonetic transcription and frequency information (≈5% of the lexical entries, imported from the Fundamental Portuguese – a spoken corpus) were collected from various sources. Porlex gives additional information about the characteristics of spoken words; for example, how many schwas they include, how they are divided into syllables, where is the stressed syllable; also a pointer for ambisyllabicity, the length in number of phonemes or of syllables, and two different types of phonetic patterns, one based on a gross classification of speech sounds, another more distinctive.