KYNYCH, F. et al. A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams Eurasip Journal on Audio, Speech, and Music Processing Springer, 2024, vol. 2024, issue 1. P. neuvedeny (16 stran). ISSN: 1687-4722.
ČERVA, P., NOUZA, J., ŽĎÁNSKÝ, J. , and MATĚJŮ, L. Softwarové moduly pro automatický přepis a zpracování mluvené dánštiny [software].
ČERVA, P., NOUZA, J., ŽĎÁNSKÝ, J. , and MATĚJŮ, L. Softwarové moduly pro automatický přepis a zpracování mluvené norštiny [software].
ČERVA, P., NOUZA, J., ŽĎÁNSKÝ, J. , and MATĚJŮ, L. Softwarové moduly pro automatický přepis a zpracování mluvené švédštiny [software].
KYNYCH, F., ČERVA, P. , and ŽĎÁNSKÝ, J. Systém pro online diarizaci mluvčích v audiovizuálních datových proudech [software].
MATĚJŮ, L. et al. Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH Dublin: ISCA, 2023 P. 3252 – 3256. ISSN: 2308-457X.
NOUZA, J., MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Developing State-of-the-Art End-to-End ASR for Norwegian Lecture Notes in Computer Science - including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Springer Science and Business, 2023 P. 200 – 213. ISBN: 978-303140497-9, ISSN: 03029743.
POLÁČEK, M., ČERVA, P., ŽĎÁNSKÝ, J. , and WEINGARTOVÁ, L. Online Punctuation Restoration using ELECTRA Model for streaming ASR Systems Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH Irsko: ISCA, 2023 P. 446 – 450. ISSN: 2308-457X.
KYNYCH, F., ŽĎÁNSKÝ, J., ČERVA, P. , and MATĚJŮ, L. Online Speaker Diarization Using Optimized SE-ResNet Architecture Lecture Notes in Computer Science Německo: Springer, 2023 P. 176 – 187. ISBN: 978-303140497-9, ISSN: 03029743.
NOUZA, J., ČERVA, P. , and ŽĎÁNSKÝ, J. Lexicon-based vs. Lexicon-free ASR for Norwegian Parliament Speech Transcription Lecture Notes in Computer Science SPRINGER-VERLAG BERLIN, 2022 P. 401 – 409. ISBN: 978-303116269-5, ISSN: 0302-9743.
MATĚJŮ, L. et al. Overlapped Speech Detection in Broadcast Streams Using X-vectors Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH Jižní Korea: ISCA, 2022 P. 4606 – 4610. ISSN: 2308-457X.
MÁLEK, J. et al. Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING IEEE, 2022, vol. 30, issue 30. P. 2295 – 2309. ISSN: 2329-9290.
MÁLEK, J. et al. Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings 1. ed. USA: IEEE, 2021 P. 226 – 230. ISSN: 1520-6149.
ČERVA, P. et al. Identification of related languages from spoken data: Moving from off-line to on-line scenario Computer Speech and Language Elsevier, 2021, vol. 68, issue JUL. P. neuvedeny (19 stran). ISSN: 0885-2308.
ČERVA, P. et al. Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-vectors Lecture Notes in Computer Science Switzerland: Springer Nature Switzerland AG, 2021 P. 371 – 381. ISBN: 978-303083526-2, ISSN: 0302-9743.
MATĚJŮ, L. et al. Using X-vectors for Speech Activity Detection in Broadcast Streams Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH ISCA, 2021 P. 4161 – 4165. ISBN: 978-171383690-2, ISSN: 2308-457X.
JANSKÝ, J. et al. Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings 1. ed. Barcelona: IEEE, 2020 P. 676 – 680. ISBN: 978-1-5090-6631-5, ISSN: 1520-6149.
CHALOUPKA, J., PALEČEK, K., ČERVA, P. , and ŽĎÁNSKÝ, J. Optical Character Recognition for Audio-Visual Broadcast Transcription System 11th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2020 - Proceedings 1. ed. Finsko: IEEE, 2020 P. 229 – 232. ISBN: 978-172818213-1.
NOUZA, J., ČERVA, P. , and ŽĎÁNSKÝ, J. Very Fast Keyword Spotting System with Real Time Factor below 0.01 Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020 1. ed. Switzerland: Springer Nature Switzerland, 2020 P. 426 – 436. ISBN: 978-303058322-4, ISSN: 0302-9743.
MÁLEK, J. , and ŽĎÁNSKÝ, J. Voice-activity and overlapped speech detection using x-vectors Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020 1. ed. Switzerland: Springer Nature Switzerland, 2020 P. 366 – 376. ISBN: 978-303058322-4, ISSN: 0302-9743.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 1. ed. Austria: ISCA, 2019 P. 649 – 653. ISSN: 2308-457X.
MÁLEK, J. , and ŽĎÁNSKÝ, J. On Practical Aspects of Multi-condition Training Based on Augmentation for Reverberation-/Noise-Robust Speech Recognition Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 1. ed. Switzerland: Springer Nature Switzerland AG., 2019 P. 251 – 263. ISBN: 978-303027946-2, ISSN: 0302-9743.
MÁLEK, J., ŽĎÁNSKÝ, J. , and ČERVA, P. Robust Recognition of Conversational Telephone Speech via Multi-Condition Training and Data Augmentation Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 21st International Conference on Text, Speech, and Dialogue, TSD 2018 Springer Verlag, 2018 P. 324 – 333. ISBN: 978-303000793-5, ISSN: 0302-9743.
MÁLEK, J., ŽĎÁNSKÝ, J. , and ČERVA, P. Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings 1. ed. Kanada: IEEE, 2018 P. 5624 – 5628. ISBN: 978-153864658-8, ISSN: 1520-6149.
MATĚJŮ, L., ČERVA, P., ŽĎÁNSKÝ, J. , and ŠAFAŘÍK, R. Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 1. ed. Indie: ISCA, 2018 P. 1803 – 1807. ISSN: 2308-457X.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription Communications in Computer and Information Science Spolková republika Německo: Springer Verlag, 2017 P. 341 – 358. ISBN: 978-331967875-7, ISSN: 1865-0929.
NOUZA, J. et al. Multilingvální platforma pro monitoring a analýzu multimédií [ověřená technologie]. 2017.
MÁLEK, J., ŽĎÁNSKÝ, J. , and ČERVA, P. Robust Automatic Recognition of Speech with Background Music 16 June 2017, Article number 7953150, Pages 5210-52142017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017; Hilton New Orleans RiversideNew Orleans; United States; 5 March 2017 through 9 March 2017; Category numberCFP USA: Institute of Electrical and Electronics Engineers Inc., 2017 P. 5210 – 5214. ISBN: 978-1-5090-4117-6, ISSN: 1520-6149.
MATĚJŮ, L., ČERVA, P., ŽĎÁNSKÝ, J. , and MÁLEK, J. Speech Activity Detection in Online Broadcast Transcription Using Deep Neural Networks and Weighted Finite State Transducers 2017 IEEE IICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedingsnternational Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 USA: Institute of Electrical and Electronics Engineers Inc., 2017 P. 5460 – 5464. ISBN: 978-1-5090-4117-6, ISSN: 1520-6149.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Study on the use of deep neural networks for speech activity detection in broadcast recordings ICETE 2016 - Proceedings of the 13th International Joint Conference on e-Business and Telecommunications Lisabon, Portugalsko: SciTePress, 2016 P. 45 – 51. ISBN: 978-989-758-196-0.
MÁLEK, J. et al. Compensation of Nonlinear Distortions in Speech for Automatic Recognition 38th International Conference on Telecommunications and Signal Processing, TSP 2015 1. ed. Praha, Česká Republika: Institute of Electrical and Electronics Engineers Inc., 2015 P. 419 – 423. ISBN: 978-1-4799-8498-5.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Investigation into the use of deep neural networks for LVCSR of Czech 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics 1. ed. Česká Republika: IEEE, 2015 P. 38 – 41. ISBN: 978-1-4799-6972-2.
NOUZA, J. et al. Unikátní softwarová technologická platforma pro přepisy archivů historických i současných pořadů ČRo a jejich zpřístupnění pomocí webu [software].