CVTStego-Net: a convolutional vision transformer architecture for spatial image steganalysis

The principal investigations in image steganalysis in the spatial domain have concentrated on convolutionalneural network (CNN) designs. However, existing CNNs increase the local receptive field of steganographicnoise without considering global steganographic noise. This study introduces CVTStego-Net, a convolutionalvision transformer for spatial domain image steganalysis that merges the strengths of convolutions and the advantages of attention mechanisms to capture both local and global dependencies. CVTStego-Net is composed of three stages: preprocessing stage, noise extraction, and analysis stage, and classification stage. The preprocessing stage involves a bifurcation with trainable and untrainable 30 SRM (Spatial Rich Models) filters to enhance steganographic noise. The noise extraction and analysis stage combines the SE-Block (Squeeze-and-Excitation) with residual operations to increase the sensitivity to steganographic noise and suppressing the influence of redundant information, and the classification stage combines SE-Block with a convolutional vision transformer to connect the local and global spatial relationships of the steganographic noise. This work enhanced the classification accuracies for steganographic algorithms compared to YEDROUDJ-Net, SR-Net,ZHU-Net, GBRAS-Net, and SNMC-Net. Specifically, the accuracy of CVTStego-Net for WOW at 0.2 bpp was 86.58%, and 0.4 bpp was 93.80%. Moreover, for S-UNIWARD at 0.2 and 0.4 bpp, the accuracies were 80.70% and 90.45%, respectively. For MiPOD at 0.2 and 0.4 bpp, the accuracies were 74.70% and 81.48%, respectively.For HILL at 0.2 and 0.4 bpp, the accuracies were 76.70% and 85.80%, respectively, and for HUGO at 0.2 and 0.4 bpp, the accuracies were 78.20% and 86.98%, respectively, using test data from the BOSSbase 1.01. The results demonstrate that convolutional vision transformers can classify steganographic images in the spatial domain.

Folders

transformer_1.py This file contains the model used in the various training experiments carried out for the research. To use it, the database must be varied and the hyperparameters adjusted to obtain the desired results, while keeping the same code.
SRM_Kernels1.npy This file contains the weights of the 30 SRM high-pass filters used for model training.

Requirements

This repository requires the following libraries and frameworks:

TensorFlow 2.10.0
scikit-learn
numPy
OpenCV
Matplotlib
os
scikit-image

This repository was developed in the Python3 (3.9.12) programming language.

Databases

The data set used to reproduce the results can be downloaded from: Here. Images taken from: BOSS competition, BOWS2 and ALASKA2 .

Authors

Universidad Autonoma de Manizales (https://www.autonoma.edu.co/)

Mario Alejandro Bravo-Ortiz
Esteban Mercado-Ruiz
Juan Pablo Villa-Pulgarin
Harold Brayan Arteaga-Arteaga
Reinel Tabares-Soto

References

[1]

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
SRM_Kernels1.npy		SRM_Kernels1.npy
transformer_1.py		transformer_1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVTStego-Net: a convolutional vision transformer architecture for spatial image steganalysis

Folders

Requirements

Databases

Authors

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CVTStego-Net: a convolutional vision transformer architecture for spatial image steganalysis

Folders

Requirements

Databases

Authors

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages