Building an Arabic Social Corpus for Dangerous Profile Extraction on Social Networks

Amal Rekik; Hanen Ameur; Amal Abid; Atika Mbarek; Wafa Kardamine; Salma Jamoussi; Abdelmajid Ben Hamadou

doi:10.13053/cys-22-4-3068

Building an Arabic Social Corpus for Dangerous Profile Extraction on Social Networks

Authors

Amal Rekik Multimedia InfoRmation systems and Advanced Computing Laboratory, MIRACL, Sfax
Hanen Ameur Multimedia InfoRmation systems and Advanced Computing Laboratory, MIRACL, Sfax
Amal Abid Multimedia InfoRmation systems and Advanced Computing Laboratory, MIRACL, Sfax
Atika Mbarek Multimedia InfoRmation systems and Advanced Computing Laboratory, MIRACL, Sfax
Wafa Kardamine Digital Research Center of Sfax DRCS, Sfax
Salma Jamoussi Multimedia InfoRmation systems and Advanced Computing Laboratory, MIRACL, Sfax
Abdelmajid Ben Hamadou Multimedia InfoRmation systems and Advanced Computing Laboratory, MIRACL, Sfax

DOI:

https://doi.org/10.13053/cys-22-4-3068

Keywords:

Data collection, annotation guidelines, social networks, suspicious content, terrorist users, arabic social corpus

Abstract

Social networks are considered today as revolutionary tools of communication that have a tremendous impact on our lives. However, these tools can be manipulated by vicious users namely terrorists. The process of collecting and analyzing such profiles is a considerably challenging task which has not yet been well established. For this purpose, we propose, in this paper, a new method for data extraction and annotation of suspicious users from social networks threatening the national security. Our method allows constructing a rich Arabic corpus designed for detecting terrorist users spreading on social networks. The amendment of our corpora is ensured following a set of rules defined by a domain expert. All these steps are described in details, and some typical examples are given. Also, some statistics are reported from the data collection and annotation stages as well as the evaluation of the annotated features based on the intra-agreement measurement between different experts.

Downloads

Published

2018-12-30

Issue

Vol. 22 No. 4 (2018): Topic Trends in Computing Research (Guest Editors: A. Aguilar-Meléndez, E. Moya-Sánchez)

Section

Articles of the Thematic Section

License

Hereby I transfer exclusively to the Journal "Computación y

Sistemas", published by the Computing Research Center (CIC-IPN),

the Copyright of the aforementioned paper. I also accept that these

rights will not be transferred to any other publication, in any other

format, language or other existing means of developing.

I certify that the paper has not been previously disclosed or simultaneo

usly submitted to any other publication, and that it does not contain

material whose publication would violate the Copyright or other

proprietary rights of any person, company or institution. I certify that

I have the permission from the institution or company where I work or

study to publish this work.

The representative author accepts the responsibility for the publication

of this paper on behalf of each and every one of the authors.

This transfer is subject to the following conditions:

The authors retain all ownership rights (such as patent rights) of this work, except for the publishing rights transferred to the CIC, through this document.
Authors retain the right to publish the work in whole or in part in any book they are the authors or publishers. They can also make use of this work in conferences, courses, personal web pages, and so on.
Authors may include working as part of his thesis, for non-profit distribution only.

Building an Arabic Social Corpus for Dangerous Profile Extraction on Social Networks

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

Developed By

Information

Language