ČERVA, P., NOUZA, J., ŽĎÁNSKÝ, J. , and MATĚJŮ, L. Softwarové moduly pro automatický přepis a zpracování mluvené dánštiny [software].
ČERVA, P., NOUZA, J., ŽĎÁNSKÝ, J. , and MATĚJŮ, L. Softwarové moduly pro automatický přepis a zpracování mluvené norštiny [software].
ČERVA, P., NOUZA, J., ŽĎÁNSKÝ, J. , and MATĚJŮ, L. Softwarové moduly pro automatický přepis a zpracování mluvené švédštiny [software].
MATĚJŮ, L. et al. Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH Dublin: ISCA, 2023 P. 3252 – 3256. ISSN: 2308-457X.
NOUZA, J., MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Developing State-of-the-Art End-to-End ASR for Norwegian Lecture Notes in Computer Science - including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics Springer Science and Business, 2023 P. 200 – 213. ISBN: 978-303140497-9, ISSN: 03029743.
KYNYCH, F., ŽĎÁNSKÝ, J., ČERVA, P. , and MATĚJŮ, L. Online Speaker Diarization Using Optimized SE-ResNet Architecture Lecture Notes in Computer Science Německo: Springer, 2023 P. 176 – 187. ISBN: 978-303140497-9, ISSN: 03029743.
MATĚJŮ, L. et al. Overlapped Speech Detection in Broadcast Streams Using X-vectors Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH Jižní Korea: ISCA, 2022 P. 4606 – 4610. ISSN: 2308-457X.
MATĚJŮ, L. et al. An Empirical Assessment of Deep Learning Approaches to Task-Oriented Dialog Management Neurocomputing Elsevier, 2021, vol. 439, issue 7. P. 327 – 339. ISSN: 0925-2312.
ČERVA, P. et al. Identification of related languages from spoken data: Moving from off-line to on-line scenario Computer Speech and Language Elsevier, 2021, vol. 68, issue JUL. P. neuvedeny (19 stran). ISSN: 0885-2308.
ČERVA, P. et al. Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-vectors Lecture Notes in Computer Science Switzerland: Springer Nature Switzerland AG, 2021 P. 371 – 381. ISBN: 978-303083526-2, ISSN: 0302-9743.
MATĚJŮ, L. et al. Using X-vectors for Speech Activity Detection in Broadcast Streams Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH ISCA, 2021 P. 4161 – 4165. ISBN: 978-171383690-2, ISSN: 2308-457X.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 1. ed. Austria: ISCA, 2019 P. 649 – 653. ISSN: 2308-457X.
ŠAFAŘÍK, R. , and MATĚJŮ, L. Automatic Development of ASR System for an Under-Resourced Language 41st International Conference on Telecommunications and Signal Processing (TSP) Řecko: IEEE, 2018 P. 100 – 103. ISBN: 978-153864695-3.
ŠAFAŘÍK, R., MATĚJŮ, L. , and WEINGARTOVÁ, L. The Influence of Errors in Phonetic Annotations on Performance of Speech Recognition System Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 21st International Conference on Text, Speech, and Dialogue, TSD 2018 1. ed. Springer Verlag, 2018 P. 419 – 427. ISBN: 978-303000793-5, ISSN: 0302-9743.
MATĚJŮ, L., ČERVA, P., ŽĎÁNSKÝ, J. , and ŠAFAŘÍK, R. Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 1. ed. Indie: ISCA, 2018 P. 1803 – 1807. ISSN: 2308-457X.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription Communications in Computer and Information Science Spolková republika Německo: Springer Verlag, 2017 P. 341 – 358. ISBN: 978-331967875-7, ISSN: 1865-0929.
MATĚJŮ, L., ČERVA, P., ŽĎÁNSKÝ, J. , and MÁLEK, J. Speech Activity Detection in Online Broadcast Transcription Using Deep Neural Networks and Weighted Finite State Transducers 2017 IEEE IICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedingsnternational Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 USA: Institute of Electrical and Electronics Engineers Inc., 2017 P. 5460 – 5464. ISBN: 978-1-5090-4117-6, ISSN: 1520-6149.
ŠAFAŘÍK, R. , and MATĚJŮ, L. The Impact of Inaccurate Phonetic Annotations on Speech Recognition Performance Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Německo: Springer Verlag, 2017 P. 402 – 410. ISBN: 9783319642055, ISSN: 0302-9743.
BOHÁČ, M., MATĚJŮ, L., ROTT, M. , and ŠAFAŘÍK, R. Automatic Syllabification and Syllable Timing of Automatically Recognized Speech - for Czech Proc. of the 19th International Conference of Text, Speech, and Dialogue - TSD 2016 Switzerland: Springer International Publishing, 2016 P. 540 – 547. ISBN: 978-3-319-45509-9, ISSN: 0302-9743.
ŠAFAŘÍK, R. , and MATĚJŮ, L. Impact of phonetic annotation precision on automatic speech recognition systems 2016 39th International Conference on Telecommunications and Signal Processing, TSP 2016 Rakousko, Vídeň: Institute of Electrical and Electronics Engineers Inc., 2016 P. 311 – 314. ISBN: 978-1-5090-1287-9, ISSN: 1805-5435.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Study on the use of deep neural networks for speech activity detection in broadcast recordings ICETE 2016 - Proceedings of the 13th International Joint Conference on e-Business and Telecommunications Lisabon, Portugalsko: SciTePress, 2016 P. 45 – 51. ISBN: 978-989-758-196-0.
MATĚJŮ, L., ČERVA, P. , and ŽĎÁNSKÝ, J. Investigation into the use of deep neural networks for LVCSR of Czech 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics 1. ed. Česká Republika: IEEE, 2015 P. 38 – 41. ISBN: 978-1-4799-6972-2.
Topics of Student work
Fearless Steps Challenge: Detekce řeči v audio nahrávkách z NASA programu Apollo, ITE, 2024
Data collection and analysis for E2E speech recognition - Vietnamese, ITE, 2024
Identification of Slavic languages from audio recordings on the VoxLingua107 dataset, ITE, 2024