dc.contributor.author | Sabzi Shahrebabaki, Abdolreza | |
dc.contributor.author | Olfati, Negar | |
dc.contributor.author | Imran, Ali Shariq | |
dc.contributor.author | Johnsen, Magne Hallstein | |
dc.contributor.author | Siniscalchi, Sabato Marco | |
dc.contributor.author | Svendsen, Torbjørn Karl | |
dc.date.accessioned | 2023-10-02T11:57:30Z | |
dc.date.available | 2023-10-02T11:57:30Z | |
dc.date.issued | 2021-05-13 | |
dc.description.abstract | This paper proposes a two-stage deep feed-forward neural network (DNN) to tackle the acoustic-to-articulatory inversion (AAI) problem. DNNs are a viable solution for the AAI task, but the temporal continuity of the estimated articulatory values has not been exploited properly when a DNN is employed. In this work, we propose to address the lack of any temporal constraints while enforcing a parameter-parsimonious solution by deploying a two-stage solution based only on DNNs: (i) Articulatory trajectories are estimated in a first stage using DNN, and (ii) a temporal window of the estimated trajectories is used in a follow-up DNN stage as a refinement. The first stage estimation could be thought of as an auxiliary additional information that poses some constraints on the inversion process. Experimental evidence demonstrates an average error reduction of 7.51% in terms of RMSE compared to the baseline, and an improvement of 2.39% with respect to Pearson correlation is also attained. Finally, we should point out that AAI is still a highly challenging problem, mainly due to the non-linearity of the acoustic-to-articulatory and one-to-many mapping. It is thus promising that a significant improvement was attained with our simple yet elegant solution. | en_US |
dc.identifier.citation | Sabzi Shahrebabaki, Olfati, Imran, Johnsen, Siniscalchi, Svendsen: A Two-Stage Deep Modeling Approach to Articulatory Inversion. In: Androutsos, Plataniotis K, Zhang X. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, 2021. IEEE | en_US |
dc.identifier.cristinID | FRIDAID 2009124 | |
dc.identifier.doi | 10.1109/ICASSP39728.2021.9413742 | |
dc.identifier.isbn | 978-1-7281-7606-2 | |
dc.identifier.issn | 1520-6149 | |
dc.identifier.issn | 2379-190X | |
dc.identifier.uri | https://hdl.handle.net/10037/31359 | |
dc.language.iso | eng | en_US |
dc.publisher | IEEE | en_US |
dc.rights.accessRights | openAccess | en_US |
dc.rights.holder | Copyright 2021 The Author(s) | en_US |
dc.title | A Two-Stage Deep Modeling Approach to Articulatory Inversion | en_US |
dc.type.version | acceptedVersion | en_US |
dc.type | Chapter | en_US |
dc.type | Bokkapittel | en_US |