Blind Speech Watermarking Method with Frame Self-Synchronization Based on Spread-Spectrum Using Linear Prediction Residue.

blind detection frame synchronization linear prediction liner prediction residue speech watermarking spread spectrum

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
11 May 2022
Historique:
received: 19 03 2022
revised: 20 04 2022
accepted: 09 05 2022
entrez: 28 5 2022
pubmed: 29 5 2022
medline: 29 5 2022
Statut: epublish

Résumé

State-of-the-art speech watermarking techniques enable speech signals to be authenticated and protected against any malicious attack to ensure secure speech communication. In general, reliable speech watermarking methods must satisfy four requirements: inaudibility, robustness, blind-detectability, and confidentiality. We previously proposed a method of non-blind speech watermarking based on direct spread spectrum (DSS) using a linear prediction (LP) scheme to solve the first two issues (inaudibility and robustness) due to distortion by spread spectrum. This method not only effectively embeds watermarks with small distortion but also has the same robustness as the DSS method. There are, however, two remaining issues with blind-detectability and confidentiality. In this work, we attempt to resolve these issues by developing an approach called the LP-DSS scheme, which takes two forms of data embedding for blind detection and frame synchronization. We incorporate blind detection with frame synchronization into the scheme to satisfy blind-detectability and incorporate two forms of data embedding process, front-side and back-side embedding for blind detection and frame synchronization, to satisfy confidentiality. We evaluated these improved processes by carrying out four objective tests (PESQ, LSD, Bit-error-rate, and accuracy of frame synchronization) to determine whether inaudibility and blind-detectability could be satisfied. We also evaluated all combinations with the two forms of data embedding for blind detection with frame synchronization by carrying out BER tests to determine whether confidentiality could be satisfied. Finally, we comparatively evaluated the proposed method by carrying out ten robustness tests against various processing and attacks. Our findings showed that an inaudible, robust, blindly detectable, and confidential speech watermarking method based on the proposed LP-DSS scheme could be achieved.

Identifiants

pubmed: 35626561
pii: e24050677
doi: 10.3390/e24050677
pmc: PMC9141528
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : I-O DATA foundation
ID : non

Références

Entropy (Basel). 2021 Sep 25;23(10):
pubmed: 34681970

Auteurs

Takuto Isoyama (T)

School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa 923-1292, Japan.

Shunsuke Kidani (S)

School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa 923-1292, Japan.

Masashi Unoki (M)

School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa 923-1292, Japan.

Classifications MeSH