Natural language watermarking via morphosyntactic alterations |
| |
Authors: | Hasan Mesut Meral Bülent Sankur A Sumru
zsoy Tunga Güngr Emre Sevin |
| |
Affiliation: | aBoğaziçi University, Linguistics Program, Bebek, İstanbul 34342, Turkey;bBoğaziçi University, Department of Electrical and Electronic Engineering, Bebek, İstanbul 34342, Turkey;cBoğaziçi University, Cognitive Science Program, Bebek, İstanbul 34342, Turkey;dBoğaziçi University, Department of Computer Engineering, Bebek, İstanbul 34342, Turkey |
| |
Abstract: | We develop a morphosyntax-based natural language watermarking scheme. In this scheme, a text is first transformed into a syntactic tree diagram where the hierarchies and the functional dependencies are made explicit. The watermarking software then operates on the sentences in syntax tree format and executes binary changes under control of Wordnet and Dictionary to avoid semantic drops. A certain level of security is provided via key-controlled randomization of morphosyntactic tools and the insertion of void watermark. The security aspects and payload aspects are evaluated statistically while the imperceptibility is measured using edit-hit counts based on human judgments. It is observed that agglutinative languages are somewhat more amenable to morphosyntax-based natural language watermarking and the free word order property of a language, like Turkish, is an extra bonus. |
| |
Keywords: | Natural language watermarking Tree bank Agglutinative Morphosyntax Text payload |
本文献已被 ScienceDirect 等数据库收录! |
|