To read this content please select one of the options below:

Topological and topical characterisation of Twitter user communities

Guillaume Gadek (Normandie Université, LITIS, INSA de Rouen Normandie, Rouen, France) (Airbus, Élancourt, France)
Alexandre Pauchet (Normandie Université, LITIS, INSA de Rouen Normandie, Rouen, France)
Nicolas Malandain (Normandie Université, LITIS, INSA de Rouen Normandie, Rouen, France)
Laurent Vercouter (Normandie Université, LITIS, INSA de Rouen Normandie, Rouen, France)
Khaled Khelif (Airbus, Élancourt, France)
Stéphan Brunessaux (Airbus, Élancourt, France)
Bruno Grilhères (Airbus, Élancourt, France)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 17 August 2018

Issue publication date: 4 October 2018

219

Abstract

Purpose

Most of the existing literature on online social networks (OSNs) either focuses on community detection in graphs without considering the topic of the messages exchanged, or concentrates exclusively on the messages without taking into account the social links. The purpose of this paper is to characterise the semantic cohesion of such groups through the introduction of new measures.

Design/methodology/approach

A theoretical model for social links and salient topics on Twitter is proposed. Also, measures to evaluate the topical cohesiveness of a group are introduced. Inspired from precision and recall, the proposed measures, called expertise and representativeness, assess how a set of groups match the topic distribution. An adapted measure is also introduced when a topic similarity can be computed. Finally, a topic relevance measure is defined, similar to tf.idf (term-frequency, inverse document frequency).

Findings

The measures yield interesting results, notably on a large tweet corpus: the metrics accurately describe the topics discussed in the tweets and enable to identify topic-focused groups. Combined with topological measures, they provide a global and concise view of the detected groups.

Originality/value

Many algorithms, applied on OSN, detect communities which often lack of meaning and internal semantic cohesion. This paper is among the first to quantify this aspect, and more precisely the topical cohesion and topical relevance of a group. Moreover, the proposed indicators can be exploited for social media monitoring, to investigate the impact of a group of people: for instance, they could be used for journalism, marketing and security purposes.

Keywords

Acknowledgements

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

Citation

Gadek, G., Pauchet, A., Malandain, N., Vercouter, L., Khelif, K., Brunessaux, S. and Grilhères, B. (2018), "Topological and topical characterisation of Twitter user communities", Data Technologies and Applications, Vol. 52 No. 4, pp. 482-501. https://doi.org/10.1108/DTA-01-2018-0006

Publisher

:

Emerald Publishing Limited

Copyright © 2018, Emerald Publishing Limited

Related articles