UoT at NADI 2023 shared task: Automatic Arabic Dialect Identification is Made Possible

Date

2023-12

Type

Conference paper

Conference title

Proceedings of The First ArabicNLP Conference 2023

Author(s)

Abdusalam Alfitory Ahmad Nwesri
Nabila S. A. Shinbir
Hassan Ali Hassan Ebrahem

Pages

620 - 624

Abstract

In this paper we present our approach towardsArabic Dialect identification which was part ofthe The Fourth Nuanced Arabic Dialect Identi-fication Shared Task (NADI 2023). We testedseveral techniques to identify Arabic dialects.We obtained the best result by fine-tuning thepre-trained MARBERTv2 model with a mod-ified training dataset. The training set wasexpanded by sorting tweets based on dialects,concatenating every two adjacent tweets, andadding them to the original dataset as newtweets. We achieved 82.87 on F1 score andwe were at the seventh position among 16 par-ticipants>

Publisher's website

View