LOCKEY: A Novel Approach to Model Authentication and Deepfake Tracking

View on arXiv ← Back to list

Authors: Mayank Kumar Singh, Naoya Takahashi, Wei-Hsiang Liao, Yuki Mitsufuji

Published: 2024-09-12 04:28:22+00:00

AI Summary

This paper proposes LOCKEY, a novel method for deepfake deterrence and user tracking in generative models. It integrates key-based authentication with watermarking, ensuring only users with valid keys can generate high-quality outputs, while embedding the user's key as a watermark for tracking.

Abstract

This paper presents a novel approach to deter unauthorized deepfakes and enable user tracking in generative models, even when the user has full access to the model parameters, by integrating key-based model authentication with watermarking techniques. Our method involves providing users with model parameters accompanied by a unique, user-specific key. During inference, the model is conditioned upon the key along with the standard input. A valid key results in the expected output, while an invalid key triggers a degraded output, thereby enforcing key-based model authentication. For user tracking, the model embeds the user's unique key as a watermark within the generated content, facilitating the identification of the user's ID. We demonstrate the effectiveness of our approach on two types of models, audio codecs and vocoders, utilizing the SilentCipher watermarking method. Additionally, we assess the robustness of the embedded watermarks against various distortions, validating their reliability in various scenarios.

Key findings

Objective and subjective evaluations demonstrate LOCKEY's effectiveness in distinguishing valid and invalid keys, maintaining high audio quality with valid keys, and achieving robust watermarking even with various distortions. The approach shows scalability with increasing key space.

Approach

LOCKEY conditions the generative model (HiFi-GAN and Encodec) on a user-specific key during inference. A valid key produces high-quality output with an embedded watermark, while an invalid key results in degraded output. SilentCipher watermarking is used for embedding.

Datasets

VCTK (for HiFi-GAN), MTG-Jamendo (for Encodec)

Model(s)

HiFi-GAN, Encodec, SilentCipher (for watermarking)

Author countries

Japan, Europe

← Previous