Andrej Žgank


2016

pdf bib
The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource
Andrej Žgank | Mirjam Sepesy Maučec | Darinka Verdonik
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

This paper presents a new Slovenian spoken language resource built from TEDx Talks. The speech database contains 242 talks in total duration of 54 hours. The annotation and transcription of acquired spoken material was generated automatically, applying acoustic segmentation and automatic speech recognition. The development and evaluation subset was also manually transcribed using the guidelines specified for the Slovenian GOS corpus. The manual transcriptions were used to evaluate the quality of unsupervised transcriptions. The average word error rate for the SI TEDx-UM evaluation subset was 50.7%, with out of vocabulary rate of 24% and language model perplexity of 390. The unsupervised transcriptions contain 372k tokens, where 32k of them were different.

2014

pdf bib
The Slovene BNSI Broadcast News database and reference speech corpus GOS: Towards the uniform guidelines for future work
Andrej Žgank | Ana Zwitter Vitez | Darinka Verdonik
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

The aim of the paper is to search for common guidelines for the future development of speech databases for less resourced languages in order to make them the most useful for both main fields of their use, linguistic research and speech technologies. We compare two standards for creating speech databases, one followed when developing the Slovene speech database for automatic speech recognition ― BNSI Broadcast News, the other followed when developing the Slovene reference speech corpus GOS, and outline possible common guidelines for future work. We also present an add-on for the GOS corpus, which enables its usage for automatic speech recognition.

2006

pdf bib
SINOD - Slovenian non-native speech database
Andrej Žgank | Darinka Verdonik | Aleksandra Zögling Markuš | Zdravko Kačič
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

This paper presents the SINOD database, which is the first Slovenian non-native speech database. It will be used to improve the performance of large vocabulary continuous speech recogniser for non-native speakers. The main quality impact is expected for acoustic models and recogniser’s vocabulary. The SINOD database is designed as supplement to the Slovenian BNSI Broadcast News database. The same BN recommendations were used for both databases. Two interviews with non-native Slovenian speakers were incorporated in the set. Both non-native speakers were female, whereas the journalist was Slovenian native male speaker. The transcription approach applied in the production phase is presented. Different statistics and analyses of database are given in the paper.

2004

pdf bib
Acquisition and Annotation of Slovenian Broadcast News Database
Andrej Žgank | Tomaž Rotovnik | Mirjam Sepesy Maučec | Darinka Verdonik | Janez Kitak | Damjan Vlaj | Vladimir Hozjan | Zdravko Kačič | Bogomir Horvat
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf bib
The COST 278 MASPER Initiative - Crosslingual Speech Recognition with Large Telephone Databases
Andrej Žgank | Zdravko Kačič | Frank Diehl | Klara Vicsi | Gyorgy Szaszak | Jozef Juhar | Slavomir Lihan
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2002

pdf bib
Preliminary Evaluation of Slovenian Mobile Database PoliDat
Andrej Žgank | Zdravko Kačič | Bogomir Horvat
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

2000

pdf bib
The COST 249 SpeechDat Multilingual Reference Recogniser
Finn Tore Johansen | Narada Warakagoda | Børge Lindberg | Gunnar Lehtinen | Zdravko Kačič | Andrej Žgank | Kjell Elenius | Giampiero Salvi
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)