Abstract:Dynamic Time Warping (DTW) is an algorithm to align temporal sequences with possible local non-linear distortions, and has been widely applied to audio, video and graphics data alignments. DTW is essentially a point-to-point matching method under some boundary and temporal consistency constraints. Although DTW obtains a global optimal solution, it does not necessarily achieve locally sensible matchings. Concretely, two temporal points with entirely dissimilar local structures may be matched by DTW. To address this problem, we propose an improved alignment algorithm, named shape Dynamic Time Warping (shapeDTW), which enhances DTW by taking point-wise local structural information into consideration. shapeDTW is inherently a DTW algorithm, but additionally attempts to pair locally similar structures and to avoid matching points with distinct neighborhood structures. We apply shapeDTW to align audio signal pairs having ground-truth alignments, as well as artificially simulated pairs of aligned sequences, and obtain quantitatively much lower alignment errors than DTW and its two variants. When shapeDTW is used as a distance measure in a nearest neighbor classifier (NN-shapeDTW) to classify time series, it beats DTW on 64 out of 84 UCR time series datasets, with significantly improved classification accuracies. By using a properly designed local structure descriptor, shapeDTW improves accuracies by more than 10% on 18 datasets. To the best of our knowledge, shapeDTW is the first distance measure under the nearest neighbor classifier scheme to significantly outperform DTW, which had been widely recognized as the best distance measure to date. Our code is publicly accessible at: <a class="link-external link-https" href="https://github.com/jiapingz/shapeDTW" rel="external noopener nofollow">this https URL</a>.

Speech recognition using Dynamic Time Warping (DTW)

Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

Application of dynamic time warping optimization algorithm in speech recognition of machine translation

Speech Recognition Implementation Using MFCC and DTW Algorithm for Home Automation

Implementation of Abnormal Sound Detection in Intelligent Surveillance Front-end System

Research on Speaker-Depended Isolated-Word Speech Recognition System

One-against-all weighted dynamic time warping for language-independent and speaker-dependent speech recognition in adverse conditions

Speaker-Independent English Consonant and Japanese Word Recognition by a Stochastic Dynamic Time Warping Method

Dynamic time warping in phoneme modeling for fast pronunciation error detection

Study of Various Machine Learning Algorithms for use with Automatic Speech Recognition

An Efficient Framework of Human Voice Verification for Robotic Applications

A Normalized Least Mean Square and Dynamic Time Warping (DTW) Algorithm for an Intelligent Quran Tutoring System

EventDTW: An Improved Dynamic Time Warping Algorithm for Aligning Biomedical Signals of Nonuniform Sampling Frequencies

Wavoice: an Mmwave-Assisted Noise-Resistant Speech Recognition System.

Wavoice: an Mmwave-Assisted Noise-Resistant Speech Recognition System

Wavoice: A mmWave-assisted Noise-resistant Speech Recognition SystemJust Accepted

Recognition of score words in freestyle kayaking using improved DTW matching

A Permutation Algorithm Based on Dynamic Time Warping in Speech Frequency-Domain Blind Source Separation

Learning Discriminative Prototypes with Dynamic Time Warping

Improvement and Application of Hale's Dynamic Time Warping Algorithm

shapeDTW: shape Dynamic Time Warping