The MultiplEYE meeting in Bucharest brought together the project teams to review annotation progress and coordinate upcoming work. The sessions covered sentence and word tokenization, POS tagging, lemmatization, syllable counting, named entity recognition, and coreference resolution. Particular emphasis was placed on sharing experiences with language-specific annotation challenges.
For Albanian POS annotation, the team discussed and resolved issues related to correct token segmentation. Attention was given to cases where two words were incorrectly merged into a single token, such as merged dative and accusative short forms. Agreed solutions were reached through joint discussion with the support team.
Named Entity Recognition was also discussed in detail, with a focus on the consistent application of entity labels across languages. Specific attention was given to the distinction between PERSON, ORG, GEOPOL, LOC, and TITLE labels.
Progress tables were reviewed collaboratively to assess the current status of each language. Overall, the Bucharest meeting strengthened team coordination and provided shared guidance for improving annotation consistency and quality across the MultiplEYE project.
Report from the WG1 meeting in Bucharest 2-5 February 2026
Posted
