Dataset Preview
The full dataset viewer is not available (click to read why). Only showing a preview of the rows.
The dataset generation failed
Error code: DatasetGenerationError
Exception: TypeError
Message: int() argument must be a string, a bytes-like object or a real number, not 'NoneType'
Traceback: Traceback (most recent call last):
File "/usr/local/lib/python3.14/site-packages/datasets/builder.py", line 1520, in _prepare_split_single
for key, record in generator:
^^^^^^^^^
File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 613, in wrapped
for item in generator(*args, **kwargs):
~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.14/site-packages/datasets/packaged_modules/webdataset/webdataset.py", line 130, in _generate_examples
for example_idx, example in enumerate(self._get_pipeline_from_tar(tar_path, tar_iterator)):
~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.14/site-packages/datasets/packaged_modules/webdataset/webdataset.py", line 34, in _get_pipeline_from_tar
for filename, f in tar_iterator:
^^^^^^^^^^^^
File "/usr/local/lib/python3.14/site-packages/datasets/utils/track.py", line 49, in __iter__
for x in self.generator(*self.args):
~~~~~~~~~~~~~~^^^^^^^^^^^^
File "/usr/local/lib/python3.14/site-packages/datasets/utils/file_utils.py", line 1405, in _iter_from_urlpath
with xopen(urlpath, "rb", download_config=download_config, block_size=0) as f:
~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.14/site-packages/datasets/utils/file_utils.py", line 982, in xopen
file_obj = fs.open(paths[0], mode)
File "<string>", line 3, in open
File "/usr/local/lib/python3.14/unittest/mock.py", line 1176, in __call__
return self._mock_call(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.14/unittest/mock.py", line 1180, in _mock_call
return self._execute_mock_call(*args, **kwargs)
~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.14/unittest/mock.py", line 1247, in _execute_mock_call
result = effect(*args, **kwargs)
File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 786, in wrapped
tracker.files[urlpath] = {"read": 0, "size": int(f.size)}
~~~^^^^^^^^
TypeError: int() argument must be a string, a bytes-like object or a real number, not 'NoneType'
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 1369, in compute_config_parquet_and_info_response
parquet_operations, partial, estimated_dataset_info = stream_convert_to_parquet(
~~~~~~~~~~~~~~~~~~~~~~~~~^
builder, max_dataset_size_bytes=max_dataset_size_bytes
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
)
^
File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 948, in stream_convert_to_parquet
builder._prepare_split(split_generator=splits_generators[split], file_format="parquet")
~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.14/site-packages/datasets/builder.py", line 1382, in _prepare_split
for job_id, done, content in self._prepare_split_single(
~~~~~~~~~~~~~~~~~~~~~~~~~~^
gen_kwargs=gen_kwargs, job_id=job_id, **_prepare_split_args
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
):
^
File "/usr/local/lib/python3.14/site-packages/datasets/builder.py", line 1560, in _prepare_split_single
raise DatasetGenerationError("An error occurred while generating the dataset") from e
datasets.exceptions.DatasetGenerationError: An error occurred while generating the datasetNeed help to make the dataset viewer work? Make sure to review how to configure the dataset viewer, and open a discussion for direct support.
jpg image | __key__ string | __url__ string |
|---|---|---|
recs_zero250m_batch_00_data_00000_rel_00003587_abs_000003587 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00000510_abs_000000510 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00000049_abs_000000049 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00000445_abs_000000445 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00005198_abs_000005198 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00004580_abs_000004580 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00005055_abs_000005055 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00008508_abs_000008508 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00006366_abs_000006366 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00003104_abs_000003104 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00039389_abs_000039389 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00004734_abs_000004734 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00010458_abs_000010458 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00038397_abs_000038397 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00010830_abs_000010830 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00014335_abs_000014335 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00062811_abs_000062811 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00064992_abs_000064992 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00003401_abs_000003401 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00001428_abs_000001428 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00001567_abs_000001567 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00006134_abs_000006134 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00001431_abs_000001431 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00006566_abs_000006566 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00002110_abs_000002110 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00000598_abs_000000598 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00016153_abs_000016153 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00002277_abs_000002277 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00009286_abs_000009286 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00002017_abs_000002017 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00009150_abs_000009150 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00010065_abs_000010065 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026720_abs_000026720 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00005399_abs_000005399 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00006571_abs_000006571 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00013066_abs_000013066 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00007463_abs_000007463 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026374_abs_000026374 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00018138_abs_000018138 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00017976_abs_000017976 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026375_abs_000026375 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00008602_abs_000008602 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00007959_abs_000007959 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00000038_abs_000000038 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00075472_abs_000075472 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00067462_abs_000067462 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00068141_abs_000068141 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026412_abs_000026412 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026787_abs_000026787 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00007321_abs_000007321 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00000951_abs_000000951 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026291_abs_000026291 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00075561_abs_000075561 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00068628_abs_000068628 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00011048_abs_000011048 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026981_abs_000026981 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00005665_abs_000005665 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00011132_abs_000011132 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00080010_abs_000080010 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00009675_abs_000009675 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00006989_abs_000006989 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00018162_abs_000018162 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00015764_abs_000015764 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00055283_abs_000055283 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00079098_abs_000079098 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00078529_abs_000078529 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00074768_abs_000074768 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00080008_abs_000080008 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00006222_abs_000006222 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00075551_abs_000075551 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00078415_abs_000078415 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00081177_abs_000081177 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00000034_abs_000000034 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00016100_abs_000016100 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00086095_abs_000086095 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00074555_abs_000074555 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00084010_abs_000084010 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00078793_abs_000078793 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00008150_abs_000008150 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00082714_abs_000082714 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00081220_abs_000081220 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00006704_abs_000006704 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00083027_abs_000083027 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00008205_abs_000008205 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00085205_abs_000085205 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00080638_abs_000080638 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00083796_abs_000083796 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00003423_abs_000003423 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00002587_abs_000002587 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00011161_abs_000011161 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00078141_abs_000078141 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00002278_abs_000002278 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00009658_abs_000009658 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00026433_abs_000026433 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00033631_abs_000033631 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00073135_abs_000073135 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00005974_abs_000005974 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00080524_abs_000080524 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00016012_abs_000016012 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar | |
recs_zero250m_batch_00_data_00000_rel_00083746_abs_000083746 | hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar |
End of preview.
This repository stores converted raw image WebDataset tar shards from multiple source datasets for streaming training.
- Downloads last month
- 5,684