Annotation

Only a few of us could imagine that obtaining information from TV broadcast is one of the basic problems of the hearing impaired. In the present time there is no equivalent access for the given group of people to the television broadcast content as it is in the case of the hearing population. Within the meaning of the legislation (Law no. 373/2013 of the Code from October 20. 2013), broadcaster is obliged to ensure multimodal approach to the digital broadcast service in a way that at least 50% is accompanied with open or closed captions corresponding with the content of the program. In a similar way, at least 10% is obligatory in the case of the licensed broadcasters. Recently, the European Federation of Hard of Hearing People (EFHOH) is pushing ahead idea to enhance ratio of the programs accompanied by open or closed captions to 100% in each EU member state. Reaching the desired goal in Slovakia using the current approach of subtitling the audiovisual content would mean spending huge amount of financial resources by the television broadcaster, because manufacturing of the closed captions is subject of laborious manual transcription of the spoken words to text by certified workers and consecutive adjustment specified by the requirements of the edict of the Ministry of the Culture of the Slovak Republic. The only economically viable option is to head towards utilization of the automatic spontaneous speech recognition and to apply modern principles and methods of the speech technologies in automatic transcription of spoken words to text. The main goal of this project proposal is applied research in the area of the natural speech processing and development of a customized pilot system for automatic subtitling of audiovisual content based on large vocabulary continuous speech recognition. Results of the applied research are going to be a base of development of system solutions (software application or service) for automatic subtitling in Slovak.

Project Objectives

The aim of this project is an applied research in the area of natural speech and language processing in order to design and develop a system specifically to automatic subtitling of audiovisual content for deaf and hard of hearing people based on automatic large vocabulary continuous speech recognition. For purpose of successful project completion following tasks are required to be fulfilled and realized:

systematic collection, processing and annotation of a corpus of new speech and text data including speaker identification, topic classification, speech style, and acoustic environment influence for purpose of existing speech and text corpora extension and subsequent actualization and adaptation of a large vocabulary continuous speech recognition system to the area of newscast and TV news broadcast;

design and develop system core for large vocabulary continuous speech recognition placed on computing server with subsequent adaptation all of its components (acoustic and language models, and vocabularies) for the automatic subtitling of an audiovisual content for live broadcast in the Slovak language;

applied research and development of advanced methods for acoustic model to speaker's gender and voice adaptation, language model to topic and speaker's speaking style adaptation, and dynamic vocabulary update on regular basis;

the comprehensive evaluation using different deployment scenarios including specific conditions simulating real environment in order to determine influence level of developed methods and approaches to resulting accuracy of the automatic transcription in automatic closed captioning of audiovisual content in the Slovak language.

Results of the applied research are going to be a base of development of system solution (in the form of a software application or service) for automatic subtitling of audiovisual content in Slovak provided to people with hearing impairments.

Members

prof. Ing. Jozef Juhár, CSc.

Principal Investigator, FEEI TUKE

Dr.h.c. prof. Ing. Anton Čižmár, CSc.

Full Professor, FEEI TUKE

Ing. Ján Staš, PhD.

Assistant Proffesor, FEEI TUKE

Ing. Daniel Hládek, PhD.

Assistant Proffesor, FEEI TUKE

Ing. Stanislav Ondáš, PhD.

Assistant Proffesor, FEEI TUKE

Ing. Matúš Pleva, PhD.

Research Assistant, FEEI TUKE

Ing. Martin Lojka, PhD.

Researcher, FEEI TUKE

Ing. Peter Viszlay, PhD.

Researcher, FEEI TUKE

Ing. Tomáš Koctúr

PhD. Student, FEEI TUKE

Ing. Dávid Čonka

PhD. Student, FEEI TUKE

Ing. Jozef Greššák

PhD. Student, FEEI TUKE

Ing. Milan Rusko, PhD.

Co-Principal Investigator, II SAS BA

Ing. Sakhia Darjaa, PhD.

Research Assistant, II SAS BA

Ing. Marián Trnka

Research Assistant, II SAS BA

Mgr. Robert Sabo, PhD.

Research Assistant, II SAS BA

RNDr. Marian Ritomský

Research Assistant, II SAS BA

Ing. Igor Guoth

Research Assistant, II SAS BA

doc. Mgr. Štefan Beňuš, PhD.

Associate Proffesor, FF UKF Nitra

Support Us

Slovak Research and Development Agency

Markíza

Slovakia, spol. s r.o.

Ministry of Culture of the Slovak Republic

EFFETA

Centre of St. Francis of Sales

SNEPEDA

Association of the Deaf Educators

Myslím

Deaf Culture Centre

PUBLICATIONS AND POPULARIZATION

Publications

Koctúr, T., Viszlay, P., Staš, J., Lojka, M.: Automatická tvorba rečových korpusov založená na komplementárnosti dvoch systémov na automatické rozpoznávanie plynulej reči v slovenčine. In: Electrical Engineering and Informatics VII: Proc. of the Faculty of Electrical Engineering and Informatics of the Technical University of Košice, 08 September 2016, ISBN 978-80-553-2599-6, pp. 87-92.
Lojka, M., Juhár, J.: Kombinácia systémov na rozpoznávanie reči spájaníám hypotéz. In: Electrical Engineering and Informatics VII: Proc. of the Faculty of Electrical Engineering and Informatics of the Technical University of Košice, 08 September 2016, ISBN 978-80-553-2599-6, pp. 275-278.
Hiľovský, M., Greššák, J., Lojka, M., Juhár, J.: MAPL – Microphone array processing library. In: Proc. of the 58th International Symposium ELMAR 2016, Zadar, Croatia, 12-14 September 2016, ISBN 978-953-184-221-1, ISSN 1334-2630, pp. 27-30.
Koctúr, T., Staš, J., Juhár, J.: Unsupervised acoustic corpora building based on variable confidence measure thresholding. In: Proc. of the 58th International Symposium ELMAR 2016, Zadar, Croatia, 12-14 September 2016, ISBN 978-953-184-221-1, ISSN 1334-2630, pp. 31-34.
Staš, J., Hládek, D., Juhár, J.: Adding filled pauses and disfluent events into language models for speech recognition. In: Proc. of the 7th IEEE International Conference on Cognitive InfoCommunications, CogInfoCom 2016, Wroclaw, Poland, 16-18 October 2016, ISBN 978-1-5090-2643-2, pp. 133-137.
Ondáš, S., Macková, L., Hládek, D.: Emotion analysis in DiaCoSk dialog corpus. In: Proc. of the 7th IEEE International Conference on Cognitive InfoCommunications, CogInfoCom 2016, Wroclaw, Poland, 16-18 October 2016, ISBN 978-1-5090-2643-2, pp. 151-155.
Staš, J., Koctúr, T., Viszlay, P.: Automatická anotácia a tvorba rečového korpusu prednášok TEDxSK a JumpSK. In: Proc. of the 11th Workshop on Intelligent and Knowledge Oriented Technologies and 35th Conference on Data and Knowledge, WIKT & DaZ 2016, Smolenice, Slovakia, 3-4 November 2016, ISBN 978-80-227-4619-9, pp. 127-132.
Hládek, D., Staš, J., Pleva, M., Ondáš, S., Kovács, L.: Survey of the word sense disambiguation and challenges for the Slovak language. In: Proc. of the 17th IEEE International Symposium on Computational Intelligence and Informatics, CINTI 2016, Budapest, Hungary, 17-19 November 2016, ISBN 978-1-5090-3908-1, pp. 225-229. (AFC)
Hládek, D., Staš, J., Ondáš, S., Juhár, J., Kovács, Z. (2017) Learning string distance with smoothing for OCR spelling correction. Multimedia Tools and Applications, Volume 76, Issue 22, ISSN 1380-7501, pp. 24549-24567 (Current Content, ISI Thomson IF=1.530).
Pleva, M., Bours, P., Ondáš, S., Juhár, J. (2017) Improving static audio keystroke analysis by score fusion of acoustic and timing data. Multimedia Tools and Applications, Volume 76, Issue 24, ISSN 1380-7501, pp. 25749- 25766 (Current Content, ISI Thomson IF=1.530).
Vavrek, J., Feciľák, P., Juhár, J., Čižmár, A. (2017) Classification of broadcast news audio data employing binary decision architecture. Computing and Informatics, Volume 36, Issue 4, ISSN 1335-9150, pp. 857-886 (Current Content, ISI Thomson IF=0.504).
Hládek, D., Staš, J., Juhár, J. (2017) Crowdsourcing language resources for speech recognition. Science. Business. Society, Vol. 2, No. 3, ISSN 2367-8380, pp. 139-142.
Guoth, I., Rusko, M., Ritomský, M., Trnka, M., Darjaa, S. (2017) Exploitation of phase-based features for emotional arousal evaluation from speech (Abstract). The Journal of the Acoustic Society of America, Vol. 141, No. 5, ISSN 0001-4966, DOI10.1121/1.4987206, page 3468 (ISI Thomson IF=1.547).
Rusko, M., Trnka, M., Darjaa, S., Ritomský, M., Guoth, I. (2017) Influence of noise on the speaker verification in the air traffic control voice communication (Abstract). The Journal of the Acoustic Society of America, Vol. 141, No. 5, ISSN 0001-4966, DOI10.1121/1.4987206, page 3469 (ISI Thomson IF=1.547).
Staš, J., Hládek, D., Viszlay, P., Koctúr, T. (2017) TEDxSK and JumpSK: A new speech recognition dedicated corpus. Journal of Linguistics, Vol. 68, No. 2, ISSN 1338-4287, pp. 346-354.
Koctúr, T., Ondáš, S., Juhár, J. (2017) Speech corpus generation based on n-gram confidence measure classification. In: Proc. of the 59th International Symposium ELMAR 2017, Zadar, Croatia, ISBN 978-953- 184-230- 3, ISSN 1334-2630, pp. 149-152.
Staš, J., Hládek, D., Juhár, J. (2017) Semantic indexing and document retrieval for personalized language modeling. In: Proc. of the 59th International Symposium ELMAR 2017, Zadar, Croatia, ISBN 978-953- 184-230- 3, ISSN 1334- 2630, pp. 157-161.
Turabzadeh, S., Meng, H., Swash, R.M., Pleva, M., Juhár, J. (2017) Real-time emotional state detection from facial expression on embedded devices. In: Proc. of the 7th International Conference on Innovative Computing Technology, INTECH 2017, Luton, London, ISBN 978-1- 5090-3990- 6, pp. 46-51.
Ondáš, S., Juhár, J., Pleva, M., Ferčák, P., Husovský, R. (2017) Multimodal dialogue system with NAO and VoiceXML dialogue manager. In: Proc. of the 8th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2017, Debrecen, Maďarsko, ISBN 978-1- 5386-1264- 4, pp. 439-444.
Vizslay, P., Staš, J., Lojka, M., Greššák, J., Juhár, J., Gereg, S. (2017) Multi-conditionally trained ASR system for reverberant speech captured by spherical microphone array in adverse acoustic conditions. In: Proc. of the 8th Language & Technology Conference, LTC 2017, Poznań, Poland, ISBN 978-83- 64864-94- 0, pp. 251-256.
Čonka, D., Viszlay, P., Juhár, J. (2017) Detektor rečovej aktivity založený na hlbokej neurónovej sieti. In: Electrical Engineering and Informatics VIII: Proc. of the Faculty of Electrical Engineering and Informatics of the Technical University of Košice, Košice, Slovakia, ISBN 978-80- 553-3192- 8, pp. 311-313.
Viszlay, P., Gereg, S., Greššák, J., Juhár, J. (2017) Dereverberácia rečového signálu založená na párovaní časovo a spektrálne dekorelovaných príznakov. In: Electrical Engineering and Informatics VIII: Proc. of the Faculty of Electrical Engineering and Informatics of the Technical University of Košice, Košice, Slovakia, ISBN 978-80- 553- 3192-8, pp. 537-542.
Čonka, D. (2017) Deep neural network based on voice activity detector. In: Proc. of the 17th Scientific Conference of Young Researchers of Faculty of Electrical Engineering and Informatics Technical University of Košice, SCYR 2017, Košice, Slovakia, ISBN 978-80- 553-3162- 1, pp. 86-87.
Koctúr, T. (2017) Neural network error classification in dual ASR unsupervised acoustic corpora building. In: Proc. of the 17th Scientific Conference of Young Researchers of Faculty of Electrical Engineering and Informatics Technical University of Košice, SCYR 2017, Košice, Slovakia, ISBN 978-80- 553-3162- 1, pp. 172-173.
Greššák, J. (2017) Software library for microphone array signal processing. In: Proc. of the 17th Scientific Conference of Young Researchers of Faculty of Electrical Engineering and Informatics Technical University of Košice, SCYR 2017, Košice, Slovakia, ISBN 978-80- 553-3162- 1, pp. 216-217.
Ondáš, S., Pleva, M., Juhár, J., Husovský, R. (2017) Preliminary evaluation of the multimodal interactive system for NAO robot. In: Proc. of the 15th International Conference on Emerging eLearning Technologies and Applications, ICETA 2017, Starý Smokovec, Slovakia, ISBN 978-5386- 3294-9, pp. 337-342.
Hudson, Ch., Bethel, C.L., Carruth, D.W., Pleva, M., Juhár, J., Ondáš, S. (2017) A training tool for speech driven human-robot interaction applications. In: Proc. of the 15th International Conference on Emerging eLearning Technologies and Applications, ICETA 2017, Starý Smokovec, Slovakia, ISBN 978-5386- 3294-9, pp. 1-6.
Vavrek, J., Viszlay, P., Lojka, M., Juhár, J., Pleva, M. (2018) Weighted fast sequential DTW for multilingual audio Query-by-Example retrieval. Journal of Intelligent Information Systems, Volume 51, Issue 2, ISSN 0925-9920, pp. 439-455 (Current Content, ISI Thomson IF=1.107)
Staš, J., Viszlay, P., Lojka, M., Koctúr, T., Hládek, D., Juhár, J. (2018) Automatic transcription and subtitling of Slovak multi-genre audiovisual recordings. In: Human Language Technology, Challenges for Computer Science and Linguistics, Vetulani, Z., Mariani, J., Kubis, M. (Eds), LNCS, Volume 10930, Springer, Cham, ISBN 978-3-319-93781-6, pp. 42-56.
Lojka, M., Viszlay, P., Staš, J., Hládek, D., Juhár, J. (2018) Slovak broadcast news speech recognition and transcription system. In: Advances in Network-Based Information Systems, Barolli, L. et al. (Eds), LNDECT, Volume 22, Springer, Cham, ISBN 978-3-319-98529-9, pp. 385-394.
Staš, J., Hládek, D., Juhár, J. (2018) Modeling of filled pauses and prolongations to improve Slovak spontaneous speech recognition. In: Cognitive Infocommunications, Theory and Applications, Klempous, R., Nikodem, J., Baranyi, P. (Eds), TIEI, Volume 13, Springer, Cham, ISBN 978-3-319-95995-5, pp. 153-176.
Pleva, M., Ondas, S. (2018) Speech application for human-robot interaction systems. Problems of Engineering, Cybernetics and Robotics, Volume 69, ISSN 0204-9848, pp. 3-14.
Guoth, I., Rusko, M., Ritomský, M., Trnka, M., Darjaa, S. (2017) Identifying tense arousal in speech using phrase based features. Proceedings of Meetings on Acoustica, Volume 30, No. 1, ISSN 1939-800X, paper 130560.
Koctúr, T. (2018) 6-gram based filtration in unsupervised speech corpora building. In: Proc. of the 18th Scientific Conference of Young Researchers of Faculty of Electrical Engineering and Informatics Technical University of Košice, SCYR 2018, Košice, Slovakia, ISBN 978-80-553-2972-7, pp. 14-15.
Koctúrová, M., Juhár, J. (2018) Prehľad súčasných trendov v rozpoznávaní reči pomocou BCI. In: Electrical Engineering and Informatics IX: Proc. of the Faculty of Electrical Engineering and Informatics of the Technical University of Košice, Košice, Slovakia, ISBN 978-80-553-2713-6, pp. 543-547.
Liao, Y.-F., Wang, Y.-R. (2018) Some experiences on applying deep learning to speech signal and natural language processing. In: Proc. of IEEE World Symposium on Digital Intelligence for Systems and Machines, DISA 2018, Košice, Slovakia, ISBN 978-153865102-5, pp. 83-94.
Darjaa, S., Sabo, R., Trnka, M., Rusko, M., Múcsková, G. (2018) Automatic recognition of Slovak dialects. In: Proc. of IEEE World Symposium on Digital Intelligence for Systems and Machines, DISA 2018, Košice, Slovakia, ISBN 978-153865102-5, pp. 305-308.
Ondáš, S., Pleva, M., Krištan, R., Husovský, R., Juhár, J. (2018) VoMIS – The VoiceXML-based multimodal interactive system for NAO robot. In: Proc. of IEEE World Symposium on Digital Intelligence for Systems and Machines, DISA 2018, Košice, Slovakia, ISBN 978-153865102-5, pp. 315-320.
Koctúrová, M., Juhár, J. (2018) An overview of BCI-based speech recogntion methods. In: Proc. of IEEE World Symposium on Digital Intelligence for Systems and Machines, DISA 2018, Košice, Slovakia, ISBN 978-153865102-5, pp. 327-330.
Chivarov, N., Chikurtev, D., Pleva, M., Ondáš, S. (2018) Exploring human-robot interfaces for service mobile robots. In: Proc. of IEEE World Symposium on Digital Intelligence for Systems and Machines, DISA 2018, Košice, Slovakia, ISBN 978-153865102-5, pp. 337-342.
Koctúrová, M., Juhár, J. (2018) EEG based voice activity detection. In: Proc. of the 16th International Conference on Emerging eLearning Technologies and Applications, ICETA 2018, Starý Smokovec, Slovakia, ISBN 978-1-5386-7915-9, pp. 267-272.
Ondáš, S., Juhár, J. (2018) Analysis of turn-taking in the Slovak interview corpus. In: Proc. of the 16th International Conference on Emerging eLearning Technologies and Applications, ICETA 2018, Starý Smokovec, Slovakia, ISBN 978-1-5386-7915-9, pp. 411-416.
Hládek, D., Staš, J. (2018) Získavanie textových dát zo slovenského internetu. In: ARANEA 2018: Web Corpora as a Language Training Tool, Bratislava, Slovakia, ISBN 978-80-223-4597-2, pp. 31-39.
Ondáš, S., Juhár, J., Kiktová, E., Zimmermann, J. (2018) Anticipation in speech-based human-machine interfaces. In: Proc. of the 9th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2018, Budapest, Hungary, pp. 117-121.
Staš, J., Hládek, D., Lojka, M., Juhár, J. (2018) Dual-space re-ranking model for efficient document retrieval, user modeling and adaptation. In: Proc. of the 60th International Symposium Electronics in Marine, ELMAR 2018, Zadar, Croatia, ISBN 978-953-184-244-0, ISSN 1334-2630, pp. 203-206.
Pleva, M., Liao, Y.-F., Hsu, W., Hladek, D., Stas, J., Viszlay, P., Lojka, M., Juhar, J. (2018) Towards Slovak-English-Mandarin speech recognition using deep learning. In: Proc. of the 60th International Symposium Electronics in Marine, ELMAR 2018, Zadar, Croatia, ISBN 978-953-184-244-0, ISSN 1334-2630, pp. 151-154.
Rusko, M., Trnka, M., Darjaa, S., Stelkens-Kobsch, T., Finke, M. (2018) Weaknesses of voice biometrics – Sensitivity of speaker verification to emotional arousal. In: Proc. of the 25th International Congress on Sound and Vibration, Hiroshima, Japan, ISBN 978-151086845-8, pp. 3716-3723.
Liao, Y.-F., Pleva, M., Hladek, D., Stas, J., Viszlay, P., Lojka, M., Juhar, J. (2018) Gated module neural network for multilingual speech recognition. In: Proc. of the 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018, Taipei, Taiwan.

Popularization

Viac informácií, vyššie vzdelanie, 25. jún 2016, Centrum účelových zariadení – stredisko SUZA, Drotárska cesta 46, 811 02 Bratislava, Slovenská republika. Účastníci: Ing. Ján Staš, PhD.
21. september 2016, Kasárne Kulturpark, Kukučínova 2, 040 01 Košice, Slovenská republika. Účastníci: Ing. Ján Staš, PhD.
H2020 ICT Proposers’ Day 2016, 26.-27. september 2016, Incheba EXPO Bratislava, Slovakia. Účastníci: Ing. Matúš Pleva, PhD., Ing. Dávid Čonka, Ing. Jozef Greššák
joined with GAMMA Workshop on Speech Processing in ATM Security, 28.-29. november 2016, Ústav informatiky, Slovenská akadémia vied, Dúbravská cesta 9, 845 07 Bratislava, Slovenská republika. Účastníci: prof. Ing. Jozef Juhár, CSc., Ing. Ján Staš, PhD., Ing. Peter Viszlay, PhD., Ing. Martin Lojka, PhD., Ing. Tomáš Koctúr
20. mája 2017, Slovenské technické múzeum, Košice, Slovensko. Účastníci: Ing. Stanislav Ondáš, PhD., Ing. Peter Viszlay, PhD., Ing. Martin Lojka, PhD.
May 31, 2017, St. Cyril and St. Methodius University of Veliko Turnovo, Slovakia. Participants: Ing. Ján Staš, PhD., Ing. Daniel Hládek, PhD.
Ing. Matúš Pleva, PhD. – Acoustic modelling for building secure speech enabled applications (invited speech), June 27, 2017, CAVS, MSU, Starkville, USA.
20. júla 2017, Technická univerzita v Košiciach, Košice, Slovensko. Účastníci: Ing. Stanislav Ondáš, PhD., Ing. Peter Viszlay, PhD.
Ing. Matúš Pleva, PhD. – Modern speech enabled HCI applications (plenary lecture), August 17, 2017, Brunel University London, London, United Kingdom.
20. septembra 2017, Štátne divadlo Košice – Malá scéna, Košice, Slovensko, Účastníci: Ing. Stanislav Ondáš, PhD.
October 09, 2017, Technical University of Košice, Košice, Slovakia. Participant: Ing. Matúš Pleva, PhD.
Mgr. Robert Sabo, PhD. – Hráme sa s hláskami (plenárna prednáška), 18. októbra 2017, Súkromná základná škola, Senec, Slovensko.
26. októbra 2017, Business Centrum T-2, Košice, Slovensko. Účastníci: Ing. Stanislav Ondáš, PhD., Ing. Peter Viszlay, PhD.
Mgr. Robert Sabo, PhD. – Ako počítače rozprávajú a píšu (plenárna prednáška), 26. októbra 2017, ÚI SAV, Bratislava, Slovensko.
November 09-10, 2017, HUNGEXPO Budapest Fair Center, Budapest, Hungary. Participant: Ing. Matúš Pleva, PhD.
12. novembra 2017, Ústav informatiky SAV, Bratislava, Slovensko. Účastníci: Mgr. Robert Sabo, PhD.
Ing. Marián Trnka – Vplyv hluku v kabíne na rozpoznávanie hovoriaceho (plenárna prednáška), 15. novembra 2017, Poráč Park, Poráč, Slovensko.
Ing. Matúš Pleva, PhD. – Voice based human-robot interaction applications (plenary lecture), November 24, 2017, Bulgarian Academy of Sciences, Sofia, Bulgaria.
Ing. Milan Rusko, PhD. – Oklamanie systému automatickej verifikácie hovoriaceho pomocou syntézy reči (plenárna prednáška), 12. decembra 2017, ÚI SAV, Bratislava, Slovensko. Účastníci: Ing. Marián Trnka, Mgr. Robert Sabo, PhD.
joined with 2nd GAMMA Workshop on Speech Processing in ATM Security, November 28-29, 2017, Institute of Informatics, SAS, Bratislava, Slovakia. Participants: prof. Ing. Jozef Juhár, CSc., Ing. Ján Staš, PhD., Ing. Martin Lojka, PhD., Ing. Tomáš Koctúr, Ing. Marianna Koctúrová, Ing. Jozef Greššák, Ing. Milan Rusko, PhD., Ing. Sakhia Darjaa, PhD., Ing. Marian Trnka, Mgr. Robert Sabo, PhD., Ing. Igor Guoth, PhD., doc. Mgr. Štefan Beňuš, PhD.
Reportážny príspevok s doc. Mgr. Štefanom Beňušom, PhD. v Magazíne o vede a technológiách (VaT) televízie RTVS, vysielané 24. februára 2018, natáčané na FIIT UK Bratislava, Slovensko
Ing. Matúš Pleva, PhD. - Building speech enabled human-computer interaction applications (plenary speech), March 21, 2018, University of Oulu, Finland
26. apríla 2018, Ústav informatiky SAV Bratislava, Slovensko. Účastník: Mgr. Róbert Sabo, PhD.
19. mája 2018, Slovenské technické múzeum, Košice, Slovensko. Účastníci: Ing. Stanislav Ondáš, PhD., Ing. Matúš Pleva, PhD., Ing. Tomáš Koctúr, Bc. Michal Krupa, Radovan Krištan, Maroš Lapčák
Ing. Stanislav Ondáš, PhD. a Ing. Matúš Pleva, PhD. - Najnovšie výsledky výskumu a inovacií v robotike a hlasovom rozpoznávaní (seminár), 27. júna 2018, Košice, Slovensko. Ďalší účastníci: prof. Ing. Jozef Juhár, CSc., Ing. Martin Lojka, PhD., Ing. Ján Staš, PhD., Ing. Peter Viszlay, PhD., Ing. Tomáš Koctúr, PhD.
10. a 17. júla 2018, Technická univerzita v Košiciach, Košice, Slovensko. Účastník: Ing. Stanislav Ondáš, PhD.
Ing. Stanislav Ondáš, PhD. a Ing. Martin Lojka, PhD. - Natural Language Processing at TU Košice (invited speech), September 06, 2018, Bratislava, Slovakia
Ing. Matúš Pleva, PhD. – Speech interface for building secure speech enabled applications (plenary speech), September 06, 2018, Dept. of Computer Science and Engineering, MSU, Starkville, USA
Rozhovor s doc. Mgr. Štefanom Beňušom, PhD. v časopise Téma s názvom „Fascinujúce tajomstvá reči“, uverejnené 07. septembra 2018
Ing. Matúš Pleva, PhD. - Implementation of a speech enabled virtual reality training (plenary speech), September 07, 2018, Center for Advanced Vehicular Systems, MSU, Starkville, USA
Rozhovor s Ing. Milanom Ruskom, PhD. na portáli Veda na dosah s názvom „Technika postupuje míľovými krokmi aj v automatickom spracovaní reči“, uverejnené 15. októbra 2018
Ing. Milan Rusko, PhD. – Ako ukradnúť hlas, ako odmerať emócie a iné tajomstvá ľudskej reči (pozvaná prednáška), 16. októbra 2018, CVTI Bratislava, Slovensko
Krátky rozhovor s Ing. Stanislavom Ondášom, PhD. o aktivitách KEMT FEI TU v Košiciach v relácii Vyznania Rádia Lumen, Rádio Lumen – Štúdio Košice, Slovensko, vysielané 21. októbra 2018
Krátky rozhovor s Ing. Milanom Ruskom, PhD. o aktivitách Oddelenia analýzy a syntézy reči ÚI SAV v Bratislave pre Rádio Regina, Slovenský rozhlas, Bratislava, Slovensko, nahávané 23. októbra 2018
Ing. Milan Rusko, PhD. - Počítačová analýza a syntéza ľudského hlasu a identifikácia emócií v reči (pozvaná prednáška), 23. októbra 2018, ÚI SAV v Bratislave, Slovensko
25. októbra 2018, Základné a stredné školy v Bratislavskom kraji. Účastník: Mgr. Róbert Sabo, PhD.
doc. Ing. Štefan Beňuš, PhD. a Ing. Marián Trnka - Rečové prispôsobovanie sa medzi človekom a robotom (seminár), 07. novembra 2018, FabLab CVTI Bratislava, Slovensko
Ing. Matúš Pleva, PhD. - Deep learning for advanced speech enabled applications (plenary speech), November 24., 2018, Dept. of Electronic Engineering, NTUT Taipei, Taiwan

Contact FEEI TUKE

Faculty of Electrical Engineering and Informatics
Technical University of Košice
Letná 9
042 00 Košice

prof. Ing. Jozef Juhár, CSc.
principal investigator
jozef.juhar(at)tuke.sk
P: +421 (0) 55 602 3208

Contact II SAS BA

Institute of Informatics
Slovak Academy of Science
Dúbravská cesta 9
845 07 Bratislava

Ing. Milan Rusko, PhD.
co-principal investigator
milan.rusko(at)savba.sk
P: +421 (0) 2 5941 1129