Difference between revisions of "OLR Challenge 2019"

From cslt Wiki
Jump to: navigation, search
Line 96: Line 96:
 
CeaFos
 
CeaFos
  
[媒体文件:Ceafos.jpg]
+
[[文件:Ceafos.jpg]]

Revision as of 09:19, 17 July 2019

Oriental Language Recognition (OLR) 2019 Challenge

Oriental languages involve interesting specialties. The OLR challenge series aim at boosting language recognition technology for oriental languages. Following the success of OLR Challenge 2018, OLR Challenge 2017 and OLR Challenge 2016, the new challenge in 2019 follows the same theme, but sets up more challenging tasks in the sense of:

  • Task 1: Short-utterance LID, where the test utterances are as short as 1 second.
  • Task 2: Cross-channel LID, where test data is in different channels from the training set.
  • Task 3: Zero-resource LID, where no resources are provided for training before inference, but several reference utterances are provided for each language.

We will publish the results on a special session of APSIPA ASC 2019.


News

  • Challenge registration open.

Data

The challenge is based on two multilingual databases, AP16-OL7 that was designed for the OLR challenge 2016, and AP17-OL3 database that was designed for the OLR challenge 2017.

AP16-OL7 is provided by SpeechOcean (www.speechocean.com), and AP17-OL3 is provided by Tsinghua University, Northwest Minzu University and Xinjiang University, under the M2ASR project supported by NSFC.

The features for AP16-OL7 involve:

  • Mobile channel
  • 7 languages in total
  • 71 hours of speech signals in total
  • Transcriptions and lexica are provided
  • The data profile is here
  • The License for the data is here

The features for AP17-OL3 involve:

  • Mobile channel
  • 3 languages in total
  • Tibetan provided by Prof. Guanyu Li@Northwest Minzu Univ.
  • Uyghur and Kazak provided by Prof. Askar Hamdulla@Xinjiang University.
  • 35 hours of speech signals in total
  • Transcriptions and lexica are provided
  • The data profile is here
  • The License for the data is here

Evaluation plan

Refer to the following paper:

Zhiyuan Tang, Dong Wang, Liming Song: AP19-OLR Challenge: Three Tasks and Their Baselines, submitted to APSIPA ASC 2019. (online version coming soon)

Evaluation tools

  • The Kaldi-based baseline scripts here

Participation rules

  • Participants from both academy and industry are welcome
  • Publications based on the data provided by the challenge should cite the following paper:

Dong Wang, Lantian Li, Difei Tang, Qing Chen, AP16-OL7: a multilingual database for oriental languages and a language recognition baseline, APSIPA ASC 2016. pdf

Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen: AP17-OLR Challenge: Data, Plan, and Baseline, APSIPA ASC 2017. pdf

Zhiyuan Tang, Dong Wang, Qing Chen: AP18-OLR Challenge: Three Tasks and Their Baselines, submitted to APSIPA ASC 2018. pdf

Zhiyuan Tang, Dong Wang, Liming Song: AP19-OLR Challenge: Three Tasks and Their Baselines, submitted to APSIPA ASC 2019. (online version coming soon)

Important dates

  • July. 16, AP19-OLR training/dev data release.
  • Oct. 1, register deadline.
  • Oct. 20, test data release.
  • Nov. 1, 24:00, Beijing time, submission deadline.
  • APSIPA ASC 2019 (18 Nov), results announcement.

Registration procedure

If you intend to participate the challenge, or if you have any questions, comments or suggestions about the challenge, please send email to the organizers ( olr19@cslt.org). For participants, the following information is required:

 - Team Name: 
 - Institute: 
 - Participants: 
 - Duty person: 
 - Hompage of person/organization/company:
 (If homepage is not available, any of your online papers in speech field are feasible.)

Organization Committee

  • Dong Wang, Tsinghua University [home]
  • Zhiyuan Tang, Tsinghua University [home]
  • Liming Song, SpeechOcean
  • More professionals...


Sponsors

CeaFos

Ceafos.jpg