Scrutinizing Implementations of Smart Home Integrations

Kulani Mahadewa,Kailong Wang,Guangdong Bai,Ling Shi,Yan Liu,Jin Song Dong,Zhenkai Liang
DOI: https://doi.org/10.1109/tse.2019.2960690
IF: 7.4
2021-12-01
IEEE Transactions on Software Engineering
Abstract:A key feature of the booming smart home is the integration of a wide assortment of technologies, including various standards, proprietary communication protocols and heterogeneous platforms. Due to customization, unsatisfied assumptions and incompatibility in the integration, critical security vulnerabilities are likely to be introduced by the integration. Hence, this work addresses the security problems in smart home systems from an integration perspective, as a complement to numerous studies that focus on the analysis of individual techniques. We propose HomeScan, an approach that examines the security of the implementations of smart home systems. It extracts the abstract specification of application-layer protocols and internal behaviors of entities, so that it is able to conduct an end-to-end security analysis against various attack models. Applying HomeScan on three extensively-used smart home systems, we have found twelve non-trivial security issues, which may lead to unauthorized remote control and credential leakage.
engineering, electrical & electronic,computer science, software engineering
What problem does this paper attempt to address?
The paper primarily addresses a key issue in the field of Voice Conversion (VC) — how to train models without the need for parallel data (i.e., the speech data of the source and target speakers do not need to be aligned to the same sentences). Traditional methods often rely on parallel data, which brings several problems: limited training data, models being applicable only to the speaker pairs used during training, and potential errors during data alignment. To solve the above issues, the authors propose a method based on Adaptive Restricted Boltzmann Machine (ARBM). This method assumes that speech signals can be decomposed into a neutral part related to linguistic information and a specific part related to speaker characteristics. Through this method, training can be conducted without parallel data, thus overcoming the limitations of traditional methods. Specifically, the method includes the following steps: 1. **Model Definition**: Use ARBM to model speech signals, where the model parameters are divided into speaker-independent parameters and speaker-dependent parameters. The model can extract latent linguistic information from speech signals while preserving the speaker's characteristics. 2. **Parameter Optimization**: Optimize the model parameters through maximum likelihood estimation. 3. **Voice Conversion Process**: Decompose the input speech into a neutral part and a speaker-specific part, replace the speaker-specific part, and then recombine to generate a new speech signal. 4. **Experimental Validation**: Evaluate the effectiveness of the proposed method through objective metrics (such as mel-cepstral distortion improvement ratio, MDIR) and subjective evaluation (mean opinion score, MOS). Experimental results show that the method can achieve high-quality voice conversion without using any parallel data, and its performance is close to traditional voice conversion methods that require parallel data. Additionally, by comparing the impact of different numbers of hidden layer units, it is found that an appropriate number of hidden layer units helps improve conversion quality.