Morpho Challenge 2010 – Semi-supervised and Unsupervised Analysis
Part of the EU Network of Excellence PASCAL2 Challenge Program.
Participation is open to all.
The objective of the Challenge is to design a statistical machine
learning algorithm that discovers which morphemes (smallest individually
meaningful units of language) words consist of. Ideally, these are basic
vocabulary units suitable for different tasks, such as text
understanding, machine translation, information retrieval, and
statistical language modeling.
The scientific goals are:
* To learn of the phenomena underlying word construction in natural
* To discover approaches suitable for a wide range of languages
* To advance machine learning methodology
Morpho Challenge 2010 is a follow-up to our previous Morpho Challenge
2005, 2007, 2008 and 2009. The task in 2010 is similar to 2009, where
the aim was to find the morpheme analysis of the word forms in the data.
As a new task we will provide a possibility for semi-supervised learning
using the available linguistic gold standard morpheme analysis.
Participation in the previous challenges is by no means a prerequisite
for participation in Morpho Challenge 2010. Everyone is welcome and we
hope to attract many participating teams. The results will be presented
in a workshop organized at our university in 2-3 September 2010. Please
read the rules and see the schedule at the home page.
If you now decided to participate in Morpho Challenge, please contact
the organizers and ask to be added in our mailing list. We will use this
mailing list to provide news about the tasks, data and evaluations.
We are looking forward to an interesting challenge!
Mikko Kurimo, Krista Lagus, Sami Virpioja and Ville Turunen
Adaptive Informatics Research Centre, Aalto University (previously
known as Helsinki University of Technology)