|
#Intermittent: constitution d'un corpus lié à un événement discursif controversé
|
Open Resources and TOols for LANGuage |
This page: https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1
Back to corpus main page: https://hdl.handle.net/11403/comere/cmr-intermittent
Download the TEI file: https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1.xml
How to cite this resource
Longhi, J., Borzic, B., Alkhouli, A.(2016). #Intermittent: constitution d'un corpus
lié à un événement discursif controversé. In Chanier T. (ed) Banque de corpus CoMeRe.
Ortolang.fr : Nancy.
[https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1]
Overview of the corpus
The corpus #Intermittent gathers tweets of 215 accounts identified
as interested in the issue of the intermittents (contract/temporary workers from the entertainment industry). The Twitter accounts (twittos in French) have
permitted the extraction of 586 239 tweets: the corpus is constituted by the 10876 tweets
from these 58239 with the hashtag "intermittent". The corpus has been converted to the TEI
format within the framework of the project CoMeRe (Communication
médiée par les réseaux, Network mediated communication)
. The CoMeRe projet aims to gather different
corpus that represent the forms of communication in French on the networks (Internet,
phone, etc.), all structured and informed in the same way, diffused in open acces for
research purposes. The CoMeRe projet has received the support of ORTOLANG (the French
equivalent of DARIAH) and of the national consortium Written-Corpus ('Corpus-écrits')
, subsection of Huma-Num.Keywords :
References
-
Julien Longhi (2006). « De intermittent du spectacle à intermittent : de la
représentation à la nomination d’un objet du discours », Corela, 4-2.
http://corela.revues.org/457
-
Julien Longhi (2008). « Sens communs et dynamiques sémantiques : l'objet discursif
INTERMITTENT. », Langages n° 170 p. 109-124
Composition
Collection cmr-intermittent-tei-v1 : list of files / identification numbers
https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittents-tei-v1-manuel.pdf
Coverage: 215 user accounts / twittos ; 11 307 posts
Rationale for this corpus
Le corpus contient 10 876 tweets correspondant aux messages pistés par
les 215 comptes identifiés comme étant les plus pertinents sur le sujet de la réforme du
statut des intermittents du spectacle en 2014, début 2015. Les comptes Twitter (twittos) de ces personnalités ont permis
l'extraction de 586 239 tweets dont 10 876 avec le hashtag #intermittent.
Le corpus a ensuite été mis au format TEI de manière semblable à celle du corpus https://hdl.handle.net/11403/comere/cmr-polititweets/
Editorial procedures
The full contents of tweets have been preserved. Information about Twiter accounts
(hence the source) have been added. The TEI structure of the tweet is described in
tagsDecl
post correspond to one tweet
Description of the Interaction Space
CMC Environment
tweet: Definition of the modality Tweet. Type of messages used in Tweets.
Structure of interactions
- text: each text correspond to the set of tweets coming from the same Twitter
account
- post: one post corresponds to one tweet.
- xml:idID of the posting.
-
whenis date of message on Twitter.
-
whoID of the twitter account, see listPerson .
-
typetype of message cf. taxononomy. Not displayed here. Default value for
all tweets
- p: This element appears inside the
- distinct: This element appears inside
- twitter-hashtag. Then the element contains ident with
#, and ref with the URL of discussion topic
- twitter-retweet. Then the element contains ident with
RT
- twitter-via. Then the element contains ident with
via
- addressingTerm: Addressing terms address an utterance to a particular
interlocutor / twitto or refers to a twitto. It includes :
- addressMarker with @
- addressee refers to a Twitter account
- trailer: This element appears inside
Data Collection
Data collected : From 2011-12-28 to 2015-08-24
location:
Twitter website
France
Language of the data:
français
Types of interaction
- channel: mode: w,
Message sent through a Twitter
account
- constitution: type: single,
Selected through automatic processing. See projectDesc for more
information
- derivation: type: original,
- domain: type: public,
domain of a message: social
- factuality: type: fact,
- interaction: type: complete,
active: many,
- preparedness: type: spontaneous,
- purpose: social discussion
Extracts of Participants
-
Person ID= cmr-intermittents-p7950382
persName:
Marin Favre
MarinFavre
-
Person ID= cmr-intermittents-p83999882
persName:
Christophe DEMAY
WWWCDORG
-
Person ID= cmr-intermittents-p1379934547
persName:
suppermittent
suppermittent
-
Person ID= cmr-intermittents-p85816090
persName:
Nicolas Séné
NicoSene
Extracts of Interactions
-
POST:
xml:id:
a635815085982658560
|
who:
#cmr-intermittents-p7950382
|
when:
2015-08-24T16:05:04.0
|
xml:lang:
fra
|
p: La question des #intermittents
#du
#spectacle est-elle définitivement réglée ? | http://t.co/7cWfsv75qK
trailer:
-
medium: Twitter Web Client
-
POST:
xml:id:
a633593462244425728
|
who:
#cmr-intermittents-p83999882
|
when:
2015-08-18T12:57:07.0
|
xml:lang:
fra
|
status:
draft
p:
RT
@L_A_Culture : "Les #intermittents sont toujours dans la panade" via
@nrpoitiers | http://t.co/qRixKbAvxJ
trailer:
-
medium: TweetDeck
-
retweetcount: 1
-
isRetweet: true
-
retweetedstatus_id: 633546447649132544
-
POST:
xml:id:
a633556144548671489
|
who:
#cmr-intermittents-p1379934547
|
when:
2015-08-18T10:28:50.0
|
xml:lang:
fra
|
status:
draft
p:
RT
@SaadaDahmani :
@suppermittent
#appli
gratuite pour les #intermittents du spectacle http://t.co/gjkj0AQWJb
trailer:
-
medium: Twitter for Android
-
retweetcount: 1
-
isRetweet: true
-
retweetedstatus_id: 632893019830755328
-
POST:
xml:id:
a630318028153057280
|
who:
#cmr-intermittents-p85816090
|
when:
2015-08-09T12:01:43.0
|
xml:lang:
fra
|
status:
draft
p:
#Rebsamen quitte le gouvernement après avoir semer la misère parmi
les #intermittents et les chômeurs #chomage
https://t.co/8In9Nkon9b
trailer:
-
medium: Twitter Web Client
-
POST:
xml:id:
a629267055775219712
|
who:
#cmr-intermittents-p83999882
|
when:
2015-08-06T14:25:31.0
|
xml:lang:
fra
|
status:
draft
p:
RT
@suppermittent : en attendant que
@pole_emploi le fasse : suppermittent , outil libre de
simulation suivi des droits #intermittents. ht…
trailer:
-
medium: Twitter Web Client
-
retweetcount: 4
-
isRetweet: true
-
retweetedstatus_id: 627035988016168962
-
POST:
xml:id:
a629230006766542848
|
who:
#cmr-intermittents-p1379934547
|
when:
2015-08-06T11:58:18.0
|
xml:lang:
fra
|
status:
draft
p:
@RadioBiCarbonat Bjr, à découvrir/RT : un outil
pratique non-marchand pr artistes et tech #intermittents >> https://t.co/zSx7laXnG8 Merci
trailer:
-
medium: Twitter Web Client
-
inReplyToUserId: 1029117938
-
inReplyToScreenName: RadioBiCarbonat
-
POST:
xml:id:
a629229851501834240
|
who:
#cmr-intermittents-p1379934547
|
when:
2015-08-06T11:57:41.0
|
xml:lang:
fra
|
status:
draft
p:
@LeTheatreSN Bjr, à découvrir/RT : un outil pratique
non-marchand pr artistes et tech #intermittents >> https://t.co/zSx7laXnG8… . Merci
trailer:
-
medium: Twitter Web Client
-
inReplyToUserId: 2233216339
-
inReplyToScreenName: LeTheatreSN
-
POST:
xml:id:
a629229322834944000
|
who:
#cmr-intermittents-p1379934547
|
when:
2015-08-06T11:55:35.0
|
xml:lang:
fra
|
status:
draft
p:
@HComineas Bjr, à découvrir/RT : un outil pratique
non-marchand pr artistes et tech #intermittents >> https://t.co/zSx7laXnG8. Merci
trailer:
-
medium: Twitter Web Client
-
inReplyToUserId: 1978255460
-
inReplyToScreenName: HComineas
-
POST:
xml:id:
a628924920790151169
|
who:
#cmr-intermittents-p1379934547
|
when:
2015-08-05T15:46:00.0
|
xml:lang:
fra
|
status:
draft
p:
@poleemploi_PDL un outil pratique et gratuit pour les
artistes et tech #intermittents >> https://t.co/zSx7laXnG8
trailer:
-
medium: Twitter Web Client
-
inReplyToUserId: 2900535185
-
inReplyToScreenName: poleemploi_PDL
Credits, Publication Statement and Rights
Publisher(s)
Date: 2016-01-04
Identifier(s)
uri: cmr-intermittent-tei-v1
url: https://hdl.handle.net/11403/comere/cmr-intermittent/cmr-intermittent-tei-v1
Licence
http://creativecommons.org/licenses/by/4.0/
Rights holders of this corpus are: Julien Longhi ; Thierry
Chanier
This corpus can be freely distributed and shared subject only to
attribution. The way to reference / cite the corpus is given in the
titleSmt
Credits
-
Sponsor(s): Consortium Corpus-écrits La création de l’Infrastructure de Recherche
CORPUS (Coopération des Opérateurs de Recherche Pour un Usage des Sources numériques)
a ouvert la possibilité de constituer un consortium linguistique spécialement dédié
aux Corpus écrits. Ce consortium est géré par l'Institut de Linguistique
Françaiseet fait partie de la TGIR (très grande infrastructure de
recherche) Huma-Num (
FRANCE
-
Sponsor(s): Université de Cergy-Pontoise
-
Sponsor(s): ORTOLANG ORTOLANG a accordé un financement de 5000 euros pour la
finalisation de ce corpus
-
Author(s): Julien, Longhi ; Thierry, Chanier ;
- compiler: depositor:
Julien, Longhi ;
- editor:
Thierry, Chanier ;
- data_inputter:
Boris, Borzic ;
- data_inputter:
Abdulhafiz, Alkhouli ;