Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug report: ValueError: invalid literal for int() with base 10: '/tmp/mds-cache/mds-coco-2014-val-fid-clip-17' #87

Open
viyjy opened this issue Oct 19, 2023 · 0 comments

Comments

@viyjy
Copy link

viyjy commented Oct 19, 2023

Hi, after installing everything by following these commands

git clone https://github.com/mosaicml/diffusion.git
cd diffusion
pip install -e .

I was trying to run fid-clip-evaluation.py but got the following error:

Traceback (most recent call last):
  File "diffusion/scripts/fid-clip-evaluation.py", line 39, in <module>
    coco_val_dataloader = build_streaming_cocoval_dataloader(
  File "diffusion/diffusion/datasets/coco/coco_captions.py", line 110, in build_streaming_cocoval_dataloader
    dataset = StreamingCOCOCaption(
  File "diffusion/diffusion/datasets/coco/coco_captions.py", line 60, in __init__
    super().__init__(
  File "python3.9/site-packages/streaming/base/dataset.py", line 496, in __init__
    self._shm_prefix_int, self._locals_shm = get_shm_prefix(streams_local, streams_remote,
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 189, in get_shm_prefix
    prefix_int = _check_and_find_retrying(streams_local, streams_remote, retry)
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 162, in _check_and_find_retrying
    raise errs[-1]
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 158, in _check_and_find_retrying
    return _check_and_find(streams_local, streams_remote)
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 115, in _check_and_find
    their_locals, _ = _unpack_locals(bytes(shm.buf))
  File "python3.9/site-packages/streaming/base/shared/prefix.py", line 75, in _unpack_locals
    return text[:-1], int(text[-1] or 0)
ValueError: invalid literal for int() with base 10: '/tmp/mds-cache/mds-coco-2014-val-fid-clip-17'
ERROR:composer.cli.launcher:Rank 0 crashed with exit code 1.
Waiting up to 30 seconds for all training processes to terminate. Press Ctrl-C to exit immediately.

This error is fixed after installing old packages:

pip install mosaicml==0.14.1
pip install mosaicml-streaming==0.5.0
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant