Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NotADirectoryError: [Errno 20] Not a directory: '../datasets/raw_data/ustc-tfc-2016/ustc-tfc-2016_detection_packet_test.json' #3

Open
ReamonYim opened this issue Sep 30, 2024 · 2 comments

Comments

@ReamonYim
Copy link

Dear author,

I encountered an issue when running the command:

python preprocess_dataset.py --input /Your/Raw/Dataset/Path --dataset_name /Your/Raw/Dataset/Name --traffic_task detection --granularity packet-level --output_path /Your/Output/Dataset/Path --output_name /Your/Output/Dataset/Name

The error is:

NotADirectoryError: [Errno 20] Not a directory: '../datasets/raw_data/ustc-tfc-2016/ustc-tfc-2016_detection_packet_test.json'

I downloaded the ustc-tfc-2016 files from training datasets. Could you please confirm whether the files from this link are already preprocessed or if they are the raw files needed for running the script?

Thank you!

@CuiTianyu961030
Copy link
Contributor

The training datasets are already preprocessed and can be directly used to train LLMs in step 2.4 and 2.5. The preprocess codes only work for extracting training data from raw traffic (i.e., .pcap files). If you want to reproduce the process of extracting the training data from the raw dataset of USTC TFC 2016, please download the raw dataset using its released link.

I hope this reply can help you.

@ReamonYim
Copy link
Author

The training datasets are already preprocessed and can be directly used to train LLMs in step 2.4 and 2.5. The preprocess codes only work for extracting training data from raw traffic (i.e., .pcap files). If you want to reproduce the process of extracting the training data from the raw dataset of USTC TFC 2016, please download the raw dataset using its released link.

I hope this reply can help you.

thank you very much

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants