IVML - publications


		about \| r&d \| publications \| courses \| people \| links

G. Vonitsanos, A. Kanavos, Ph. Mylonas

A Comparative Evaluation of NLP Frameworks for Spanish Part-of-Speech Tagging Using the CoNLL-2002 Corpus

IEEE International Conference on Big Data (IEEE BigData 2025), December 8-11, 2025, Macau, China

ABSTRACT

This paper presents a comparative study of three widely adopted Natural Language Processing (NLP) frameworks—NLTK, spaCy, and Stanza—for Part-of-Speech (POS) tagging in Spanish. Using the CoNLL-2002 corpus as a benchmark, the study evaluates each tool’s performance in terms of accuracy, F1-score, execution time, and error distribution. The experimental design ensures consistency across datasets, preprocessing, and evaluation metrics, including unified XPOS→UPOS mapping for cross-tool comparability. Results reveal that NLTK’s n-gram backoff tagger achieves the highest overall accuracy (97.16%) and computational efficiency, outperforming both neural models in speed and stability. Stanza demonstrates competitive F1 performance (87.4%) but incurs greater computational cost due to its BiLSTM architecture, while spaCy shows lower accuracy yet excels in processing speed, making it suitable for real-time applications. The findings highlight key trade-offs among statistical and neural approaches, emphasizing the influence of corpus alignment, model complexity, and linguistic adaptation. Future work will explore transformer-based models, multilingual evaluation, and explainable-AI integration to enhance interpretability and resource efficiency in POS tagging.

08 December , 2025

G. Vonitsanos, A. Kanavos, Ph. Mylonas, "A Comparative Evaluation of NLP Frameworks for Spanish Part-of-Speech Tagging Using the CoNLL-2002 Corpus", IEEE International Conference on Big Data (IEEE BigData 2025), December 8-11, 2025, Macau, China

[ BibTex] [ Print] [

Back]