-
Notifications
You must be signed in to change notification settings - Fork 359
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failed to evaluate job outputs - IOException: Could not read from s3... #4687
Comments
Hey, did you ever manage to get a workaround for this error? |
@caaespin I'm assuming that means you still see this. Are you using a recent Cromwell version? (42+) |
@geoffjentry yes. My current deployment is v42. If you have access to the GATK forums, i put more details in my post there: https://gatkforums.broadinstitute.org/wdl/discussion/24268/aws-batch-randomly-fails-when-running-multiple-workflows/p1?new=1 |
One up. I have similar error |
@geoffjentry from inspecting logs and AWS Batch console, i think what is happening is that the jobs fail because Cromwell shutdowns the VMs earlier than expected. So one of shard hasn't finished and is unable to upload to S3, hence the problem here occurs. Anyways this is a hypothesis based on what I saw, hopefully is helpful. |
@geoffjentry Any movement on this? I'm having this same issue sporadically (v48 + AWS backend) with workflows that contain large scatter operations. |
@alexwaldrop NB that I don't work there anymore and sadly haven't had the energy to actively contribute. Perhaps @aednichols can chime in |
I am having the same error with the example "Using Data on S3" on https://docs.opendata.aws/genomics-workflows/orchestration/cromwell/cromwell-examples/ . I have changed the S3 bucket name in the .json file to my bucket name, but the run still failed. After reporting running failure, I have got the same error message. I am using cromwell-48. The S3 bucket has all public access, and I was logged in as the Admin in two terminal windows, one running the server and the other submitting the job. The previous two hello-world example were successful. There is no log file in the bucket and in the cromwell-execution, the only file create was the script. There is no rc or stderr or stdout created. |
@blindmouse Were you able to resolve your issue? I am encountering the same problem. Thanks. |
This can happen if the job fails meaning that an rc.txt file isn’t created.
It would be worth looking at the CloudWatch log for the batch job.
…On Tue, Jul 21, 2020 at 4:07 PM Sri Paladugu ***@***.***> wrote:
Is there any progress on this issue? I am the getting the following
exception: IOException: Could not read from
s3:///results/ReadFile/5fec5c4a-2e3f-49ed-8f9e-6d9d2d759449/call-read_file/read_file-rc.txt
Caused by: java.nio.file.NoSuchFileException: s3://
s3.amazonaws.com/s3bucketname/results/ReadFile/5fec5c4a-2e3f-49ed-8f9e-6d9d2d759449/call-read_file/read_file-rc.txt
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#4687 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AF2E6EMJZ66Z5PIAEUX3IBLR4XYPZANCNFSM4G23FFUQ>
.
|
Cloudwatch logs contained the following message: "/bin/bash: /var/scratch/fetch_and_run.sh: Is a directory" |
It may be that you’re running Cromwell 52 or later with an older AWS
CloudFormation built infrastructure. Can you share which build of Cromwell
you’re using and the build/ version/ origin of the CloudFormation template?
On Tue, Jul 21, 2020 at 8:18 PM Sri Paladugu <[email protected]>
wrote:
… This can happen if the job fails meaning that an rc.txt file isn’t
created. It would be worth looking at the CloudWatch log for the batch job.
… <#m_-7712250081708699723_>
On Tue, Jul 21, 2020 at 4:07 PM Sri Paladugu *@*.***> wrote: Is there any
progress on this issue? I am the getting the following exception:
IOException: Could not read from
s3:///results/ReadFile/5fec5c4a-2e3f-49ed-8f9e-6d9d2d759449/call-read_file/read_file-rc.txt
Caused by: java.nio.file.NoSuchFileException: s3://
s3.amazonaws.com/s3bucketname/results/ReadFile/5fec5c4a-2e3f-49ed-8f9e-6d9d2d759449/call-read_file/read_file-rc.txt
— You are receiving this because you are subscribed to this thread. Reply
to this email directly, view it on GitHub <#4687 (comment)
<#4687 (comment)>>,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/AF2E6EMJZ66Z5PIAEUX3IBLR4XYPZANCNFSM4G23FFUQ
.
Cloudwatch logs contained the following message: "/bin/bash:
/var/scratch/fetch_and_run.sh: Is a directory"
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#4687 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AF2E6ENOHHXQP6VC5XUGZ5TR4YV5XANCNFSM4G23FFUQ>
.
|
Hi @markjschreiber I'm also running into this error. I am using cromwell 53 with a custom cdk stack based on the CloudFormation infrastructure described here: https://docs.opendata.aws/genomics-workflows/ Are modifications needed for compatibility with newer versions of Cromwell? Are these documented somewhere? |
Attached is some documentation that works for v52 and should work for v53
…On Wed, Sep 9, 2020 at 9:20 AM mderan-da ***@***.***> wrote:
Hi @markjschreiber <https://github.com/markjschreiber> I'm also running
into this error. I am using cromwell 53 with a custom cdk stack based on
the CloudFormation infrastructure described here:
https://docs.opendata.aws/genomics-workflows/
Are modifications needed for compatibility with newer versions of
Cromwell? Are these documented somewhere?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#4687 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AF2E6EO6WEE4BYYPTX4HZ2LSE56JXANCNFSM4G23FFUQ>
.
|
Hi @markjschreiber Thanks but it looks like the attachment didn't come through. |
@markjschreiber running into the same error for both v52 and v53.1. I am using the same CloudFormation @mderan-da mentioned . Appreciate your newer documentation on this. |
Documentation can be downloaded from here
https://cromwell-share-ad485.s3.us-east-2.amazonaws.com/InstallingGenomicsWorkflowCoreWithCromwel52.pdf
…On Sun, Sep 13, 2020 at 4:48 PM Yaomin Xu ***@***.***> wrote:
@markjschreiber <https://github.com/markjschreiber> running into the same
error for both v52 and v53.1. I am using the same CloudFormation
@mderan-da <https://github.com/mderan-da> mentioned . Appreciate your
newer documentation on this.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#4687 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AF2E6EKCM56WST3J6NO5CS3SFUVYLANCNFSM4G23FFUQ>
.
|
Also have this error. Anyone figure out what the issue is? |
Also have this error, using Cromwell 52, installed using this manual : logs say : fetch_and_run.is is a directory. |
Extra info : cloning job & resubmitting through aws console runs fine. so it seems to be a temporary issue |
Hmmm, still stuck on this - any updates from your guys' end? I tried cloning and resubmitting, still getting the same error. |
Still getting this error today. |
I'm getting this error almost certainly when I run workflows where more samples (e.g. 96) than usual are scattered. Is there a workaround to this? |
Hi all, are there any updates to a workaround for this error? I'm getting the same error using Cromwell v87 |
While testing cromwell-36 with AWS batch I was able to reproduce this error:
The error occurs when running many sub-workflows within a single wrapping workflow.
The environment is configured correctly, and the test usually passes when running <30 subworkflows.
Here are the workflows:
run_multiple_test.wdl
three_task_sequence.wdl
Here is the cromwell-conf:
Would appreciate help on this.
I wonder if cromwell was ever tested for many parallel sub-workflows running on AWS?
Thanks!
The text was updated successfully, but these errors were encountered: