Input of CNN_LSTM model

Hi Nika,

I am also interested in using CNN-LSTM architecture to detect deepfake videos. Thank you for your awesome work. I get a lot of benefits from it. Here I would like to ask you a question about training CNN_LSTM model.

In the forward function of class [CNN_LSTM](https://github.com/Megatvini/DeepFaceForgeryDetection/blob/fd111d0125c63504d3ef6bba08ad5a782a843db6/src/model.py#L53), the shape of the input images returns five variables, which is different from the input of class FaceRecognitionCNN. What is the difference between 'num_channels' and 'depth'? It seems we have to change the function of dataloader to fit the input of CNN_LSTM, otherwise, it raises errors with not enough values to unpack.

Could you explain what are the variables of 'num_channels' and 'depth' means in this line? How should I change the dataloader to pack the train data to fit the dimension of the input of CNN_LSTM?

I am looking forward to your answer. Thank you very much.

Best

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input of CNN_LSTM model #11

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Input of CNN_LSTM model #11

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions