Generalizing Face Forgery Detection with High-frequency Features 论文阅读

date

Jan 6, 2023

Last edited time

Mar 27, 2023 08:38 AM

status

Published

slug

Generalizing_Face_Forgery_Detection_with_High-frequency_Features论文阅读

tags

summary

type

Post

origin

https://www.notion.so/lazurite/Generalizing-Face-Forgery-Detection-with-High-frequency-Features-c3df4d6f6a854edca61f6e27e2102644

Field

Plat

Generalizing Face Forgery Detection with High-frequency Features

Current face forgery detection methods achieve high accuracy under the within-database scenario where training and testing forgeries are synthesized by the same algorithm. However, few of them gain satisfying performance under the cross-database scenario where training and testing forgeries are synthesized by different algorithms. In this paper, we find that current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.

https://ieeexplore.ieee.org/document/9578868/

Luo 等 - 2021 - Generalizing Face Forgery Detection with High-freq.pdf

2218.1KB

Abstract Analysis Why current methods fail to generalize?What is common in forged face images?SRM Noise Method Multi-scale High-frequency Feature Extraction Residual Guided Spatial Attention Dual Cross-modality Attention Experiments Result Ablation Study

Abstract

Problem

当前的人脸伪造检测方法在训练和测试伪造由相同算法合成的数据库内场景下实现了高精度。然而，在训练和测试伪造由不同算法合成的跨数据库场景下，很少能够获得令人满意的性能。

人脸伪造检测的泛化性问题是由于不同操作技术产生的数据分布多样化，具有高数据库内检测精度的方法在跨数据库场景中总是会出现严重的性能下降，从而限制了更广泛的应用。

Analysis

我们发现当前基于 CNN 的检测器倾向于过度拟合特定于方法的颜色纹理，因此无法泛化。观察到高频噪声可以抑制图像纹理并暴露篡改区域和真实区域之间的统计差异，我们建议利用噪声来解决过度拟合问题。

Method

首先是多尺度高频特征提取模块。我们采用 SRM 中广泛使用的高通滤波器来从图像中提取高频噪声。

同时使用高频噪声和低频纹理(RGB)，我们构建了一个双流网络来处理两种模式。

应用 residual guided spatial attention，引导 RGB 模态更加重视伪造痕迹。

设计了一个双重跨模式注意模块来制定两种模式之间的交互，而不是让它们保持独立。

Analysis

Why current methods fail to generalize?

现有的模型泛化能力差的原因是那些深度 CNN 模型学会了捕捉方法特定的纹理模式以进行伪造检测。Geirhos 等人研究了 CNN 的纹理响应，表明 CNN 模型强烈偏向于纹理。不同的伪造算法总是具有独特的网络架构和处理流，因此不同算法处理的图像将具有不同的伪造纹理。因此，已经偏向于一种假纹理的 CNN 模型很难泛化到另一种。