Issues Regarding Dataset Preprocessing

Dear Author,

Thank you very much for sharing your excellent work. I truly appreciate your contribution and am very interested in exploring it further.

While going through the dataset preprocessing steps, I encountered some difficulties with a few of the datasets. Apart from the four datasets in CSV format that were already shared (arcene, isolet, micromass, and parkinson), I am having trouble preparing the CSV files for the others. I would greatly appreciate your guidance on the following:
- For the Lung and TOX-171 datasets, I noticed that the shared files are in .mat format. Should I convert these .mat files into CSV format for use in the code?

- For the BCTIL dataset, there are several files including cluster.txt, file_suplemental_info.txt, metadata.tsv, normalised_data_unimputed.tsv, trm_counts.tsv, trm_saver.tsv, and trm.h5seurat. Could you kindly let me know which of these should be used for preprocessing, or if any of them should be converted to CSV?

- Regarding the P53 dataset, after downloading from the provided link, I found two folders (p53_new_2012 and p53_old_2010), each containing files like K1-K9.def, K9.data, p53.names, and K8.instance.tags. I would be grateful if you could share which files should be used and how I might convert them to CSV format for use with your code.

- Lastly, for the HAR dataset, could you please let me know which specific file should be used for preprocessing?

Thank you again for your time and support. Any help would be greatly appreciated. I look forward to your guidance.




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues Regarding Dataset Preprocessing #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issues Regarding Dataset Preprocessing #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions