logo comere

#Intermittent: constitution d'un corpus lié à un événement discursif controversé

logo ortolang
Open Resources and TOols for LANGuage

This page: https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1
Back to corpus main page: https://hdl.handle.net/11403/comere/cmr-intermittent

Download the TEI file: https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1.xml

How to cite this resource

Longhi, J., Borzic, B., Alkhouli, A.(2016). #Intermittent: constitution d'un corpus lié à un événement discursif controversé. In Chanier T. (ed) Banque de corpus CoMeRe. Ortolang.fr : Nancy. [https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1]

Overview of the corpus

The corpus #Intermittent gathers tweets of 215 accounts identified as interested in the issue of the intermittents (contract/temporary workers from the entertainment industry). The Twitter accounts (twittos in French) have permitted the extraction of 586 239 tweets: the corpus is constituted by the 10876 tweets from these 58239 with the hashtag "intermittent". The corpus has been converted to the TEI format within the framework of the project CoMeRe (Communication médiée par les réseaux, Network mediated communication) . The CoMeRe projet aims to gather different corpus that represent the forms of communication in French on the networks (Internet, phone, etc.), all structured and informed in the same way, diffused in open acces for research purposes. The CoMeRe projet has received the support of ORTOLANG (the French equivalent of DARIAH) and of the national consortium Written-Corpus ('Corpus-écrits') , subsection of Huma-Num.

Keywords :



Collection cmr-intermittent-tei-v1 : list of files / identification numbers
Coverage: 215 user accounts / twittos ; 11 307 posts

Rationale for this corpus

Le corpus contient 10 876 tweets correspondant aux messages pistés par les 215 comptes identifiés comme étant les plus pertinents sur le sujet de la réforme du statut des intermittents du spectacle en 2014, début 2015. Les comptes Twitter (twittos) de ces personnalités ont permis l'extraction de 586 239 tweets dont 10 876 avec le hashtag #intermittent. Le corpus a ensuite été mis au format TEI de manière semblable à celle du corpus https://hdl.handle.net/11403/comere/cmr-polititweets/

Editorial procedures

The full contents of tweets have been preserved. Information about Twiter accounts (hence the source) have been added. The TEI structure of the tweet is described in tagsDecl

post correspond to one tweet

Description of the Interaction Space

CMC Environment

  • tweet: Definition of the modality Tweet. Type of messages used in Tweets.
  • Structure of interactions

    Data Collection

    Data collected : From 2011-12-28 to 2015-08-24
    location: Twitter website France

    Language of the data: français

    Types of interaction

    Extracts of Participants

    Extracts of Interactions

    Credits, Publication Statement and Rights


    Date: 2016-01-04


    uri: cmr-intermittent-tei-v1
    url: https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1



    Rights holders of this corpus are: Julien Longhi ; Thierry Chanier

    This corpus can be freely distributed and shared subject only to attribution. The way to reference / cite the corpus is given in the titleSmt