TempEval-2
Evaluating Events, Time Expressions, and Temporal Relations
This is the home page for Tempeval-2, task #13 of the Semeval-2010 competition. Tempeval-2 is a follow-up on Tempeval-1, which was an initial evaluation exercise based on three limited temporal relation tasks.
Latest News
- March 17th, 2010: released patch for the HTML data, which were showing the wrong attributes. The patch contains html versions of the data for Chinese, English, Italian and Spanish.
- March 11th, 2010: training data for Chinese, English, French, Italian and Spanish released to SemEval.
Summary
Newspaper texts, narratives and other texts describe events occurring in time, explicitly and implicitly specifying the temporal location and order of these events. Text comprehension requires the capability to identify the events described in a text and to locate them in time.
We provide three tasks that are relevant to understanding the temporal structure of a text: (i) identification of events, (ii) identification of time expressions and (iii) identification of temporal relations. The temporal relations task is further structured into four sub tasks, requiring systems to recognize which of a fixed set of temporal relations holds between (a) events and time expressions within the same sentence (b) events and the document creation time (c) main events in consecutive sentences, and (d) two events where one syntactically dominates the other.
The annotation scheme used is based on TimeML. TimeML (http://www.timeml.org) has been developed over the last decade as a general multilingual markup language for temporal information in texts and has been accepted as an ISO standard.
Data sets will be provided for six languages: English, Italian, Spanish, Chinese, Korean and French. The data sets do not comprise a parallel corpus. Sizes may range from 25K to 150K tokens. Participants can choose any combination of the three main tasks and the five languages.
Please read the task proposal (pdf file) for some more background and a slightly more detailed overview of the tasks.
Trial Data
The trial data are now available for English and Italian. The release notes are available on line, the trial data themselves are posted on the Semeval-2010 website.
Training Data
Training data are now available for Chinese, English, French, Italaan and Spanish. The release notes are available on line, the training data themselves will be posted on the Semeval-2010 website. There will be a second batch of training data on March 28th, see the release notes for more details.
Mailing List
Last updated: October 16th, 2009.