-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running RMS tests failed. #70
Comments
Maybe it is the index file. Can you check whether the chromosome information is correctly stored in |
I don't have an index file, and my data is not BS-seq data. So I did not perform build-reference, Processing single-end data, and Processing paired-end data processes. I just changed my methylation result data to a similar allc file format, so the seventh column of my "allc file" was simply set to 1. |
Yes, DMRfind only needs allc files. Do you mind to share the two files for me to reproduce the error? |
Ok.If this step can be achieved,it will be very helpful for me .Thank you very much. What is your email address? My file is a little big. |
Can you reproduce the error with the first say 30 lines of allc files? |
Ok.My methylation data is: Splitting allc files for chromosome 1 <class 'KeyError'> 179 |
I am also facing the same problem, please have you been able to find the error. |
It is totally fine to set the last column to be 1. The current issue is that the context column (4th) format in the input file is not supported by methylpy. Reformatting the sequence context as the the last three bases should fix this problem. For example, |
Hi is there a way to set --chroms 1 2 parameter to accept more than one string. For example my data is formated as NC_037328.1 but the map function splits it into ["N", "C", " _", "0", "3", "7", "3", "2", "8", ".", "1" ]. That is the cause of my error. Is there a way to set it, my data is very large and I am reluctant to reformat it? |
Methylpy should be able to handle the chromosome names with more than one characters like chr1. Can you post the command you ran? |
methylpy DMRfind |
What version of methylpy are you using? I am not able to reproduce your error. Below are what I tried. Input files are attached. Are you able to run the below command without error?
Input files: |
I am using methylpy 1.4.3 version. The example you gave me works for me also but my input is not working. This is the exact error that I get: <class 'KeyError'> 184 |
Do you mind to share the first 20 lines of your allc files? |
NC_037328.1,28599,+,CAG,0,1,1 |
Ah, the fields need to be tab separated. Can we try fixing the format and running DMRfind? |
Its tab separated; the split I replace the split with coma. Howvere doing a print gives this: NC_037328.1 28599 + CAG 0 1 1 NC_037328.1 34167 + CTG 0 2 1 NC_037328.1 47181 - CAT 0 1 1 NC_037328.1 134883 - CAT 0 1 1 NC_037328.1 138299 - CAT 0 2 1 NC_037328.1 138300 + CCT 0 2 1 NC_037328.1 138301 + CTG 0 2 1 NC_037328.1 138303 - CAG 0 2 1 NC_037328.1 138306 - CAT 0 2 1 NC_037328.1 138310 + CAC 0 2 1 NC_037328.1 138312 + CAG 0 2 1 NC_037328.1 138314 - CTG 0 2 1 NC_037328.1 138317 + CAA 0 2 1 NC_037328.1 138320 - CTT 0 2 1 NC_037328.1 138322 - CAC 0 2 1 NC_037328.1 140407 - CTA 0 4 1 NC_037328.1 140408 - CCT 0 4 1 NC_037328.1 140409 + CAA 0 4 1 NC_037328.1 145179 - CAG 0 1 1 NC_037328.1 145180 - CCA 0 1 1 NC_037328.1 145868 - CAA 0 3 1 NC_037328.1 146655 + CAA 1 5 1 NC_037328.1 149309 - CAG 0 1 1 could the extra space be the problem? |
It seems that there are no CGN sites in your allc files. Is that correct? If so, that could be the cause of the problem. Do you also get the same error by running this?
|
Hello, I recently wanted to use methylpy to calculate DMR, I processed my methylation data into the allc file format,:
But I got an error like this:
Splitting allc files for chromosome 1
Wed Jun 16 20:12:56 2021
<class 'KeyError'> 179
'1'
Running RMS tests failed.
I don't know what the problem is, I hope to advise. thank you.
The text was updated successfully, but these errors were encountered: