IVML  
  about | r&d | publications | courses | people | links
   

G. Vonitsanos, A. Kanavos, Ph. Mylonas
A Comparative Evaluation of NLP Frameworks for Spanish Part-of-Speech Tagging Using the CoNLL-2002 Corpus
IEEE International Conference on Big Data (IEEE BigData 2025), December 8-11, 2025, Macau, China
ABSTRACT
This paper presents a comparative study of three widely adopted Natural Language Processing (NLP) frameworks—NLTK, spaCy, and Stanza—for Part-of-Speech (POS) tagging in Spanish. Using the CoNLL-2002 corpus as a benchmark, the study evaluates each tool¢s performance in terms of accuracy, F1-score, execution time, and error distribution. The experimental design ensures consistency across datasets, preprocessing, and evaluation metrics, including unified XPOS→UPOS mapping for cross-tool comparability. Results reveal that NLTK¢s n-gram backoff tagger achieves the highest overall accuracy (97.16%) and computational efficiency, outperforming both neural models in speed and stability. Stanza demonstrates competitive F1 performance (87.4%) but incurs greater computational cost due to its BiLSTM architecture, while spaCy shows lower accuracy yet excels in processing speed, making it suitable for real-time applications. The findings highlight key trade-offs among statistical and neural approaches, emphasizing the influence of corpus alignment, model complexity, and linguistic adaptation. Future work will explore transformer-based models, multilingual evaluation, and explainable-AI integration to enhance interpretability and resource efficiency in POS tagging.
08 December , 2025
G. Vonitsanos, A. Kanavos, Ph. Mylonas, "A Comparative Evaluation of NLP Frameworks for Spanish Part-of-Speech Tagging Using the CoNLL-2002 Corpus", IEEE International Conference on Big Data (IEEE BigData 2025), December 8-11, 2025, Macau, China
[ BibTex] [ Print] [ Back]

© 00 The Image, Video and Multimedia Systems Laboratory - v1.12