User Tools

Site Tools


how_to_obtain_licenses_for_the_shared_task_data

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
how_to_obtain_licenses_for_the_shared_task_data [2014/05/20 01:34]
seddah
how_to_obtain_licenses_for_the_shared_task_data [2014/05/26 04:39] (current)
seddah
Line 8: Line 8:
 === 1. Arabic Treebank === === 1. Arabic Treebank ===
  
-The Arabic Treebank is distributed by the LDC. In order to obtain, please download the following license, fill it out, sign it, and fax it to LDC, attention: Ilya Ahtaridis, fax number +1 215-573-2175. Make sure that you include your email address! Alternatively you can also mail a signed scanned copy of the licence to ldc@ldc.upenn.edu (with object "​[SPMRL ​2013 Shared task] Arabic data set").  +The Arabic Treebank is distributed by the LDC. In order to obtain, please download the following license, fill it out, sign it, and fax it to LDC, attention: Ilya Ahtaridis, fax number +1 215-573-2175. Make sure that you include your email address! Alternatively you can also mail a signed scanned copy of the licence to ldc@ldc.upenn.edu (with object "​[SPMRL ​2014 Shared task] Arabic data set").  
-**Please note that the Arabic data set is not yet available so please do not mail the LDC yet. You'll receive a mail through the mailing list when they'll be ready (next week as of June 62013).**+**Please note that the Unlabeled ​Arabic data set is not yet available so please do not mail the LDC yet. You'll receive a mail through the mailing list when it'll be ready (next week as of May 262014).**
  
  
-Arabic: {{:​dlall:​arabicldclicense.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​arabicldclicense.pdf|alternate download]]+Arabic: {{Arabic.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​spmrl2014_evaluation_agreement.pdf|alternate download]]
  
 ---- ----
Line 22: Line 22:
  
  
-French: {{:​dlall:​FrenchTreebank_License.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​frenchtreebank_license.pdf|alternate download]]+French: {{French.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​French.pdf|alternate download]]
        
-Hungarian: {{:dlall:hungarian.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​hungarian.pdf|alternate download]]+Hungarian: {{hungarian.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​hungarian.pdf|alternate download]]
        
 ---- ----
Line 39: Line 39:
 All these treebanks will be distributed with the licensed treebanks. In order to obtain them, fill out the following form, stating that you will use the data set only for the shared task, scan it, and send it to spmrl.sharedtask@gmail.com. Do not send this form to the LDC! All these treebanks will be distributed with the licensed treebanks. In order to obtain them, fill out the following form, stating that you will use the data set only for the shared task, scan it, and send it to spmrl.sharedtask@gmail.com. Do not send this form to the LDC!
  
-general form: {{:dlall:generalform2.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​generalform2.pdf|alternate download]]+general form: {{generalform2.pdf|}} [[http://​pauillac.inria.fr/​~seddah/​generalform2.pdf|alternate download]]
  
  
Line 46: Line 46:
  
 ==== How to Obtain The Unlabeled Data Sets ==== ==== How to Obtain The Unlabeled Data Sets ====
-Most of the unlabeled data sets we use (at the exception of the French, Hebrew, German, Polish which are covered by the creative common license -- cc-by-nc-sa-- and Basque, specific research-only license) are subjected to the same license as their treebank counterparts.+Most of the unlabeled data sets we use (at the exception of the French, Hebrew, German, Polish which are covered by the creative common license -- cc-by-nc-sa-- and Basque, specific research-only license) are subjected to the same license as their treebank counterparts. ​Once the shared task completed, all free-licensed data will be made available. 
 + 
 +=== 1. Licensed Unlabeled Data ===
  
 For the shared task duration, All data, but Arabic, will be made available through the restricted access download page. The unlabeled arabic data will be made available via the account provided by the LDC.  For the shared task duration, All data, but Arabic, will be made available through the restricted access download page. The unlabeled arabic data will be made available via the account provided by the LDC. 
-After the shared task, all free-licensed data will be made available. 
-  
  
----- 
  
-=== In Case of Problems ===+Arabic unlabeled data:  subjected to the same license as the Arabic treebank data set. 
 + 
 +Basque Unlabeled data:  {{basque-corpus-agreement.pdf|Basque license}}[[http://​pauillac.inria.fr/​~seddah/​Basque-Corpus-Agreement.pdf|alternate download]] 
 + 
 + 
 +=== 2. Free for research Licensed Unlabeled Data === 
 + 
 +Hungarian, Korean and Swedish: (see General form above) 
 + 
 + 
 +=== 3. Openly Licensed Unlabeled Data === 
 + 
 + 
 +Note that at the exception of the French (based on the Est Republicain corpus, governed by the cc-by-nc-sa ​  ​licence) and Hebrew (cc-by-sa 4.0), the raw text of the wikidumps (German, Polish) is subjected to the cc-by-sa license. The status of their added annotations will be explicited shortly. 
 + 
 +French: [[http://​creativecommons.org/​licenses/​by-nc-sa/​2.0/​|cc-by-nc-sa 2.0]] 
 + 
 +German,​Polish:​ [[http://​creativecommons.org/​licenses/​by-sa/​3.0/​|cc-by-sa 3.0]] 
 + 
 +Hebrew: [[http://​creativecommons.org/​licenses/​by-sa/​4.0/​|cc-by-sa 4.0]] 
 + 
 +For further references, those precisions are included in the general form cited above. 
 + 
 + 
 + 
 + 
 +---- 
 +==== In Case of Problems ​====
  
-In case of problems, contact spmrl.sharedtask@gmail.com. If you cannot scan the licenses, we can provide a fax number. ​+In case of problems, contact ​**spmrl.sharedtask@gmail.com**. If you cannot scan the licenses, we can provide a fax number. ​
how_to_obtain_licenses_for_the_shared_task_data.1400542447.txt.gz · Last modified: 2014/05/20 01:34 by seddah