Mourad Abbas


2020

pdf bib
Simple vs Oversampling-based Classification Methods for Fine Grained Arabic Dialect Identification in Twitter
Mohamed Lichouri | Mourad Abbas
Proceedings of the Fifth Arabic Natural Language Processing Workshop

In this paper, we present a description of our experiments on country-level Arabic dialect identification. A comparison study between a set of classifiers has been carried out. The best results were achieved using the Linear Support Vector Classification (LSVC) model by applying a Random Over Sampling (ROS) process yielding an F1-score of 18.74% in the post-evaluation phase.In the evaluation phase, our best submitted system has achieved an F1-score of 18.27%, very close to the average F1-score (18.80%) obtained for all the submitted systems.

pdf bib
SpeechTrans@SMM4H’20: Impact of Preprocessing and N-grams on Automatic Classification of Tweets That Mention Medications
Mohamed Lichouri | Mourad Abbas
Proceedings of the Fifth Social Media Mining for Health Applications Workshop & Shared Task

This paper describes our system developed for automatically classifying tweets that mention medications. We used the Decision Tree classifier for this task. We have shown that using some elementary preprocessing steps and TF-IDF n-grams led to acceptable classifier performance. Indeed, the F1-score recorded was 74.58% in the development phase and 63.70% in the test phase.

2019

pdf bib
ST MADAR 2019 Shared Task: Arabic Fine-Grained Dialect Identification
Mourad Abbas | Mohamed Lichouri | Abed Alhakim Freihat
Proceedings of the Fourth Arabic Natural Language Processing Workshop

This paper describes the solution that we propose on MADAR 2019 Arabic Fine-Grained Dialect Identification task. The proposed solution utilized a set of classifiers that we trained on character and word features. These classifiers are: Support Vector Machines (SVM), Bernoulli Naive Bayes (BNB), Multinomial Naive Bayes (MNB), Logistic Regression (LR), Stochastic Gradient Descent (SGD), Passive Aggressive(PA) and Perceptron (PC). The system achieved competitive results, with a performance of 62.87 % and 62.12 % for both development and test sets.

pdf bib
Proceedings of the 3rd International Conference on Natural Language and Speech Processing
Mourad Abbas | Abed Alhakim Freihat
Proceedings of the 3rd International Conference on Natural Language and Speech Processing

pdf bib
An Arabic Multi-Domain Spoken Language Understanding System
Mohamed Lichouri | Mourad Abbas | Rachida Djeradi | Amar Djeradi
Proceedings of the 3rd International Conference on Natural Language and Speech Processing

pdf bib
Building a Speech Corpus based on Arabic Podcasts for Language and Dialect Identification
Khaled Lounnas | Mourad Abbas | Mohamed Lichouri
Proceedings of the 3rd International Conference on Natural Language and Speech Processing

pdf bib
Proceedings of The First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) co-located with ICNLSP 2019 - Short Papers
Abed Alhakim Freihat | Mourad Abbas
Proceedings of The First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) co-located with ICNLSP 2019 - Short Papers

pdf bib
ST NSURL 2019 Shared Task: Semantic Question Similarity in Arabic
Mohamed Lichouri | Mourad Abbas | Besma Benaziz | Abed Alhakim Freihat
Proceedings of The First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) co-located with ICNLSP 2019 - Short Papers

2015

pdf bib
Machine Translation Experiments on PADIC: A Parallel Arabic DIalect Corpus
Karima Meftouh | Salima Harrat | Salma Jamoussi | Mourad Abbas | Kamel Smaili
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation