<?xml 
version="1.0" encoding="utf-8"?><?xml-stylesheet title="XSL formatting" type="text/xsl" href="https://talne.eu/spip/spip.php?page=backend.xslt" ?>
<rss version="2.0" 
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:atom="http://www.w3.org/2005/Atom"
>

<channel xml:lang="fr">
	<title>MC2 2018 Lab</title>
	<link>https://clef2018.clef-initiative.eu/mc2/</link>
	<description>MC2 CLEF Lab is centered on mining the social media sphere surrounding cultural events such as festivals and movies, It provides access for registered participants to the microbolg collection of the GAFES project funded by the French National Research Agency and lead by the University of Avignon.</description>
	<language>fr</language>
	<generator>SPIP - www.spip.net</generator>
	<atom:link href="https://talne.eu/spip/spip.php?id_rubrique=12&amp;page=backend" rel="self" type="application/rss+xml" />




<item xml:lang="en">
		<title>More about use case, data and evaluation process</title>
		<link>https://talne.eu/spip/spip.php?page=article&amp;id_article=19</link>
		<guid isPermaLink="true">https://talne.eu/spip/spip.php?page=article&amp;id_article=19</guid>
		<dc:date>2018-02-09T09:39:44Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>Chiraz Latiri, Julio Gonzalo, Malek Hajjem</dc:creator>



		<description>
&lt;p&gt;Detailed description &lt;br class='autobr' /&gt;
use case &lt;br class='autobr' /&gt;
Given, a selected of festivals name from popular festivals on FlickR English and French language, participants have to search for the most argumentative tweets in a collection covering 18 months of news about festivals in different languages. The identified tweets have to be a summary of ranked tweets according to their probability of being argumentative tweets. This use case was proposed to help festival organiser treating such set of tweets on priority. (&#8230;)&lt;/p&gt;


-
&lt;a href="https://talne.eu/spip/spip.php?page=rubrique&amp;id_rubrique=12" rel="directory"&gt;2- Mining opinion argumentation&lt;/a&gt;


		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;h2 class=&#034;spip&#034;&gt;Detailed description&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;use case&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Given, a selected of festivals name from popular festivals on FlickR &lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/English-topics.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;English&lt;/a&gt; and &lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/French_topics.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;French&lt;/a&gt; language, participants have to search for the most argumentative tweets in a collection covering 18 months of news about festivals in different languages. The identified tweets have to be a summary of ranked tweets according to their probability of being argumentative tweets. This use case was proposed to help festival organiser treating such set of tweets on priority. That is why the more the summary of ranked tweets is variant the in term of argumentation the more the run is useful.&lt;br class='autobr' /&gt;
For each language English and French (English and French), a monolingual scenario is expected : Given a festival name from topics file, participants have to search, from the microblog collection, the set of the most argumentative tweets in the same query language.&lt;br class='autobr' /&gt;
Samples of argumentative Tweets are provided here: &lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/English_sample_with_5_tweets.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;English_Sample&lt;/a&gt;, &lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/French_sample_5_tweets.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;French_Sample&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Topics&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/English-topics.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;English&lt;/a&gt; and &lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/French_topics.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;French&lt;/a&gt; contain respectively 12 and 4 festival name. They represent a set of some popular festivals on FlickR for which we have pictures. Topics were carefully selected by organizer to ensure that selected topics have enough related argumentative tweets in our corpus. Such manual selection was conduct to to ensure a possible evaluation.&lt;/p&gt;
&lt;p&gt;The choice of FlickR as source of topic was motivated by the fact that such social media platform had a high quality amateur pictures. This personal involvement serves our goal as we are interested mainly to personal tweets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Microblog Corpus&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A login is required to access the data, once registered to &lt;a href=&#034;http://clef2018-labs-registration.dei.unipd.it/&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;CLEF&lt;/a&gt;&lt;/p&gt;
&lt;ul class=&#034;spip&#034; role=&#034;list&#034;&gt;&lt;li&gt; The complete stream of 70 000 000 microblogs is available &lt;a href=&#034;https://mc2.talne.eu/data/clef/&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;here&lt;/a&gt; for registered participants. This document collection is provided by GAFES. Microblogs are provided with their meta-information and expanded URLs on a MySQL server. &lt;br class='autobr' /&gt;
Due to legal terms the access to this database is restricted to registered participants under privacy agreement.&lt;/li&gt;&lt;li&gt; An [indri Index with a web interface-&lt;a href=&#034;https://mc2.talne.eu/data/clef/api&#034; class=&#034;spip_url spip_out auto&#034; rel=&#034;nofollow external&#034;&gt;https://mc2.talne.eu/data/clef/api&lt;/a&gt;] are available to query the whole set of microblogs&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Evaluation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The official evaluation measure is NDCG.&lt;/p&gt;
&lt;p&gt;This ranking measures will give a score for each retrieved tweet with a discount function over the rank. As we are mostly interested in top ranked arguments, this ranking measures meet our expectation.&lt;br class='autobr' /&gt;
This measure was also used in TREC Microblog Track [1]. A tweet is considered as highly relevant when it is a personal and contains an argument that directly refers to the festival (topic).&lt;/p&gt;
&lt;p&gt;&lt;i&gt;[1] Overview of the TREC-2015 Microblog Track&lt;br class='autobr' /&gt;
Jimmy Lin,Miles Efron, Yulu Wang, Garrick sherman, Ellen Voorhees&lt;/i&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Result Submission&lt;/strong&gt;&lt;br class='autobr' /&gt;
The runs must respect the classical trec top files format as describe above. Only the top 100 results for each query run must be given. Each run in each language, must contain 3 fields:
&lt;br /&gt;&lt;span class=&#034;spip-puce ltr&#034;&gt;&lt;b&gt;&#8211;&lt;/b&gt;&lt;/span&gt; Id : a long integer representation of the unique identifier of this Tweet
&lt;br /&gt;&lt;span class=&#034;spip-puce ltr&#034;&gt;&lt;b&gt;&#8211;&lt;/b&gt;&lt;/span&gt; Scores : The probability of being an argument tweet accorded by participant system &lt;br /&gt;&lt;span class=&#034;spip-puce ltr&#034;&gt;&lt;b&gt;&#8211;&lt;/b&gt;&lt;/span&gt; Rank : The accorded position of the tweet in the grading list of argument tweets
&lt;br /&gt;&lt;span class=&#034;spip-puce ltr&#034;&gt;&lt;b&gt;&#8211;&lt;/b&gt;&lt;/span&gt; Content: Microblog textual content&lt;/p&gt;
&lt;p&gt;Diversity criteria: &lt;br class='autobr' /&gt;
The more a run detects different arguments about a cultural event, the more it is interesting.&lt;/p&gt;
&lt;p&gt;Exemples about &#034;Cannes festival name:
&lt;br /&gt;&lt;span class=&#034;spip-puce ltr&#034;&gt;&lt;b&gt;&#8211;&lt;/b&gt;&lt;/span&gt; I ve seen some people saying they're boycotting Cannes &lt;i&gt;because of the high heels rule&lt;/i&gt;. I'm not sure they'll notice.
&lt;br /&gt;&lt;span class=&#034;spip-puce ltr&#034;&gt;&lt;b&gt;&#8211;&lt;/b&gt;&lt;/span&gt; Not going to lie, one of my favorite things about the Cannes festival &lt;i&gt;is all of these handsome men in tuxedos.&lt;/i&gt;
&lt;br /&gt;&lt;span class=&#034;spip-puce ltr&#034;&gt;&lt;b&gt;&#8211;&lt;/b&gt;&lt;/span&gt; Cannes is relevant because &lt;i&gt;movies get timed standing ovations&lt;/i&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How to get the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To get an access to the Microblog corpus, email malek.hajjem@univ-avignon.fr or registered to &lt;a href=&#034;http://clef2018-labs-registration.dei.unipd.it/&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;CLEF&lt;/a&gt;&lt;br class='autobr' /&gt;
The English topics can be downloaded &lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/English-topics.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;here&lt;/a&gt; &lt;br class='autobr' /&gt;
The French topics can be downloaded &lt;a href=&#034;https://mc2.talne.eu/~t17malek/mc2_2018_t2/opinion/en/arg/data_t2_sample/French_topics.csv&#034; class=&#034;spip_out&#034; rel=&#034;external&#034;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Contact Information&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If you have any question, email us through this address mail : malek.hajjem@univ-avignon.fr&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Towards Argumentative Ranking </title>
		<link>https://talne.eu/spip/spip.php?page=article&amp;id_article=18</link>
		<guid isPermaLink="true">https://talne.eu/spip/spip.php?page=article&amp;id_article=18</guid>
		<dc:date>2018-02-08T18:17:15Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>Chiraz Latiri, Julio Gonzalo, Malek Hajjem</dc:creator>



		<description>
&lt;p&gt;Organizers: &lt;br class='autobr' /&gt;
Chiraz Latiri, Julio Gonzalo, Malek Hajjem &lt;br class='autobr' /&gt;
Task 2 participation deadline April 30, 2018 &lt;br class='autobr' /&gt;
Argumentative Ranking of Microblogs &lt;br class='autobr' /&gt;
Argumentation mining is a new problem in corpus-based text analysis that addresses the challenging task of automatically identifying the justifications provided by opinion holders for their judgment. Several approaches of argumentation mining have been proposed so far in areas such as legal documents, on-line debates, product reviews, newspaper (&#8230;)&lt;/p&gt;


-
&lt;a href="https://talne.eu/spip/spip.php?page=rubrique&amp;id_rubrique=12" rel="directory"&gt;2- Mining opinion argumentation&lt;/a&gt;


		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;&lt;strong&gt;Organizers: &lt;br class='autobr' /&gt;
&lt;/strong&gt;&lt;br class='autobr' /&gt;
&lt;i&gt;Chiraz Latiri, Julio Gonzalo, Malek Hajjem&lt;/i&gt;&lt;/p&gt;
&lt;p&gt;Task 2 participation deadline &lt;strong&gt;April 30, 2018&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Argumentative Ranking of Microblogs&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Argumentation mining is a new problem in corpus-based text analysis that addresses the challenging task of automatically identifying the justifications provided by opinion holders for their judgment. Several approaches of argumentation mining have been proposed so far in areas such as legal documents, on-line debates, product reviews, newspaper articles and court cases, as well as in dialogical domains. &lt;br class='autobr' /&gt;
With the popularization of social networks, argumentation mining is considered as an extension of the opinion mining issue from social network content. The aim is to automatically identify reason-conclusion structures that can lead to model social web user's positions about a service or an event expressed through social networks platforms like Twitter. Indeed, when we need to form an opinion on a new topic or make a decision, arguments will be all what we are looking for.&lt;br class='autobr' /&gt; To make argumentation structures available, in case of Twitter, a robust automatic recognition of it is required, based on resources that should be created in a reproducible fashion to be reliable. However, the ambiguity of natural language text produced in social media, with different writing styles, implicit context and heterogeneous content make argumentation, on Twitter, very challenging.&lt;/p&gt;
&lt;p&gt;Another possible way to pick up the argumentation structures, from a generic tweet corpus, is to use approaches based on information extraction. The idea is to perform a search process that focus on claims about a given topic out in a massive collection. This approach relates to the field of focused retrieval, that aims to provide users with direct access to relevant information in retrieved documents. In this task, relevant information is expressed in the form of arguments. [1]&lt;/p&gt;
&lt;p&gt;Success of such argumentation ranking will require interdisciplinary approaches based on the combination of different research issues. In fact, to better understand a short text and be able to detect the argumentative structures within a microblog, we could restore a &#171; text contextualization &#187; as a way to provide more information on the corresponding text [2]. Providing such information in order to detect argumentative tweets, would highlight relevant ones, in other words, tweets expressed in the form of arguments. Thus, argumentation mining in this situation will tend to act in the same way of an Information Retrieval (IR) system where potential argumentative tweets had to come first. Similar approach that addresses such purpose is presented in [3], where the output of the priority task will be a ranking of tweets according to their probability of being a potential threat to the reputation of some entity.&lt;/p&gt;
&lt;p&gt;&lt;i&gt;[1] Argumentative Ranking&lt;br class='autobr' /&gt;
Marco Lippi and Paolo Sarti and Paolo Torroni DISI - Universita degli Studi di Bologna Proceedings of Natural Language Processing meets Journalism - IJCAI-16 Workshop (NLPMJ 2016), New York, (July 2016)&lt;br class='autobr' /&gt;
[2] INEX Tweet Contextualization task : Evaluation, results and lesson learned&lt;br class='autobr' /&gt;
Patrice Bellot, V&#233;ronique Moriceau, Josiane Mothe, Eric SanJuan, Xavier Tannier:- Inf. Process. Manage. 52(5): 801-819 (2016)&lt;br class='autobr' /&gt;
[3] Overview of RepLab 2013: Evaluating Online Reputation Monitoring Systems&lt;br class='autobr' /&gt;
Enrique Amigo, Jorge Carrillo de Albornoz, Irina Chugur, Adolfo Corujo Julio Gonzalo, Tamara Martin, Edgar Meij Maarten de Rijke and Damiano Spina&lt;/i&gt;&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>



</channel>

</rss>
