Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLAP_freesound broken #99

Open
cyrusvahidi opened this issue Nov 27, 2023 · 0 comments
Open

CLAP_freesound broken #99

cyrusvahidi opened this issue Nov 27, 2023 · 0 comments

Comments

@cyrusvahidi
Copy link

cyrusvahidi commented Nov 27, 2023

I am trying to use CLAP_freesound from huggingface datasets. I get the following error when trying to load the dataset with datasets:

(ssl) bash-4.2$ python
Python 3.10.7 (main, Nov  2 2022, 14:46:09) [GCC 12.1.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from datasets import load_dataset
>>> load_dataset("Meranti/CLAP_freesound")
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/data/home/acw532/venvs/ssl/lib/python3.10/site-packages/datasets/load.py", line 2128, in load_dataset
    builder_instance = load_dataset_builder(
  File "/data/home/acw532/venvs/ssl/lib/python3.10/site-packages/datasets/load.py", line 1814, in load_dataset_builder
    dataset_module = dataset_module_factory(
  File "/data/home/acw532/venvs/ssl/lib/python3.10/site-packages/datasets/load.py", line 1511, in dataset_module_factory
    raise e1 from None
  File "/data/home/acw532/venvs/ssl/lib/python3.10/site-packages/datasets/load.py", line 1495, in dataset_module_factory
    ).get_module()
  File "/data/home/acw532/venvs/ssl/lib/python3.10/site-packages/datasets/load.py", line 1053, in get_module
    module_name, default_builder_kwargs = infer_module_for_data_files(
  File "/data/home/acw532/venvs/ssl/lib/python3.10/site-packages/datasets/load.py", line 512, in infer_module_for_data_files
    raise ValueError(f"Couldn't infer the same data file format for all splits. Got {split_modules}")
ValueError: Couldn't infer the same data file format for all splits. Got {NamedSplit('train'): (None, {}), NamedSplit('test'): ('json', {})}

I am finding it difficult to get started with LAION-audio-630k and aggregate all of the datasets.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant