Dataset Preview
Duplicate
The full dataset viewer is not available (click to read why). Only showing a preview of the rows.
The dataset generation failed
Error code:   DatasetGenerationError
Exception:    TypeError
Message:      int() argument must be a string, a bytes-like object or a real number, not 'NoneType'
Traceback:    Traceback (most recent call last):
                File "/usr/local/lib/python3.14/site-packages/datasets/builder.py", line 1520, in _prepare_split_single
                  for key, record in generator:
                                     ^^^^^^^^^
                File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 613, in wrapped
                  for item in generator(*args, **kwargs):
                              ~~~~~~~~~^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/site-packages/datasets/packaged_modules/webdataset/webdataset.py", line 130, in _generate_examples
                  for example_idx, example in enumerate(self._get_pipeline_from_tar(tar_path, tar_iterator)):
                                              ~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/site-packages/datasets/packaged_modules/webdataset/webdataset.py", line 34, in _get_pipeline_from_tar
                  for filename, f in tar_iterator:
                                     ^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/site-packages/datasets/utils/track.py", line 49, in __iter__
                  for x in self.generator(*self.args):
                           ~~~~~~~~~~~~~~^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/site-packages/datasets/utils/file_utils.py", line 1405, in _iter_from_urlpath
                  with xopen(urlpath, "rb", download_config=download_config, block_size=0) as f:
                       ~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/site-packages/datasets/utils/file_utils.py", line 982, in xopen
                  file_obj = fs.open(paths[0], mode)
                File "<string>", line 3, in open
                File "/usr/local/lib/python3.14/unittest/mock.py", line 1176, in __call__
                  return self._mock_call(*args, **kwargs)
                         ~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/unittest/mock.py", line 1180, in _mock_call
                  return self._execute_mock_call(*args, **kwargs)
                         ~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/unittest/mock.py", line 1247, in _execute_mock_call
                  result = effect(*args, **kwargs)
                File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 786, in wrapped
                  tracker.files[urlpath] = {"read": 0, "size": int(f.size)}
                                                               ~~~^^^^^^^^
              TypeError: int() argument must be a string, a bytes-like object or a real number, not 'NoneType'
              
              The above exception was the direct cause of the following exception:
              
              Traceback (most recent call last):
                File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 1369, in compute_config_parquet_and_info_response
                  parquet_operations, partial, estimated_dataset_info = stream_convert_to_parquet(
                                                                        ~~~~~~~~~~~~~~~~~~~~~~~~~^
                      builder, max_dataset_size_bytes=max_dataset_size_bytes
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                  )
                  ^
                File "/src/services/worker/src/worker/job_runners/config/parquet_and_info.py", line 948, in stream_convert_to_parquet
                  builder._prepare_split(split_generator=splits_generators[split], file_format="parquet")
                  ~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.14/site-packages/datasets/builder.py", line 1382, in _prepare_split
                  for job_id, done, content in self._prepare_split_single(
                                               ~~~~~~~~~~~~~~~~~~~~~~~~~~^
                      gen_kwargs=gen_kwargs, job_id=job_id, **_prepare_split_args
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                  ):
                  ^
                File "/usr/local/lib/python3.14/site-packages/datasets/builder.py", line 1560, in _prepare_split_single
                  raise DatasetGenerationError("An error occurred while generating the dataset") from e
              datasets.exceptions.DatasetGenerationError: An error occurred while generating the dataset

Need help to make the dataset viewer work? Make sure to review how to configure the dataset viewer, and open a discussion for direct support.

jpg
image
__key__
string
__url__
string
recs_zero250m_batch_00_data_00000_rel_00003587_abs_000003587
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00000510_abs_000000510
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00000049_abs_000000049
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00000445_abs_000000445
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00005198_abs_000005198
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00004580_abs_000004580
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00005055_abs_000005055
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00008508_abs_000008508
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00006366_abs_000006366
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00003104_abs_000003104
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00039389_abs_000039389
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00004734_abs_000004734
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00010458_abs_000010458
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00038397_abs_000038397
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00010830_abs_000010830
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00014335_abs_000014335
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00062811_abs_000062811
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00064992_abs_000064992
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00003401_abs_000003401
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00001428_abs_000001428
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00001567_abs_000001567
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00006134_abs_000006134
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00001431_abs_000001431
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00006566_abs_000006566
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00002110_abs_000002110
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00000598_abs_000000598
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00016153_abs_000016153
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00002277_abs_000002277
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00009286_abs_000009286
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00002017_abs_000002017
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00009150_abs_000009150
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00010065_abs_000010065
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026720_abs_000026720
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00005399_abs_000005399
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00006571_abs_000006571
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00013066_abs_000013066
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00007463_abs_000007463
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026374_abs_000026374
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00018138_abs_000018138
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00017976_abs_000017976
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026375_abs_000026375
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00008602_abs_000008602
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00007959_abs_000007959
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00000038_abs_000000038
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00075472_abs_000075472
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00067462_abs_000067462
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00068141_abs_000068141
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026412_abs_000026412
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026787_abs_000026787
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00007321_abs_000007321
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00000951_abs_000000951
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026291_abs_000026291
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00075561_abs_000075561
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00068628_abs_000068628
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00011048_abs_000011048
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026981_abs_000026981
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00005665_abs_000005665
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00011132_abs_000011132
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00080010_abs_000080010
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00009675_abs_000009675
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00006989_abs_000006989
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00018162_abs_000018162
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00015764_abs_000015764
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00055283_abs_000055283
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00079098_abs_000079098
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00078529_abs_000078529
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00074768_abs_000074768
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00080008_abs_000080008
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00006222_abs_000006222
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00075551_abs_000075551
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00078415_abs_000078415
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00081177_abs_000081177
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00000034_abs_000000034
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00016100_abs_000016100
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00086095_abs_000086095
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00074555_abs_000074555
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00084010_abs_000084010
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00078793_abs_000078793
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00008150_abs_000008150
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00082714_abs_000082714
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00081220_abs_000081220
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00006704_abs_000006704
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00083027_abs_000083027
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00008205_abs_000008205
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00085205_abs_000085205
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00080638_abs_000080638
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00083796_abs_000083796
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00003423_abs_000003423
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00002587_abs_000002587
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00011161_abs_000011161
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00078141_abs_000078141
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00002278_abs_000002278
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00009658_abs_000009658
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00026433_abs_000026433
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00033631_abs_000033631
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00073135_abs_000073135
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00005974_abs_000005974
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00080524_abs_000080524
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00016012_abs_000016012
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
recs_zero250m_batch_00_data_00000_rel_00083746_abs_000083746
hf://datasets/MirukuZhang/Pocket-Rocket-1.0-Mid-Training-23M@6dbef8f286a747369503fa074cb336da3f1bfead/ccmb5m/train-00000-of-00209.tar
End of preview.

This repository stores converted raw image WebDataset tar shards from multiple source datasets for streaming training.

Downloads last month
5,684