-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Demultiplexed FASTQ support #23
Conversation
This is a minor bugfix
For demultiplexed FASTQs, there is no column barcode policy
8738b9f
to
46cc0ae
Compare
f404cdf
to
ddc22d9
Compare
ddc22d9
to
4c16be1
Compare
src/main/scala/org/broadinstitute/gpp/poolq3/barcode/DmuxedBarcodeSource.scala
Outdated
Show resolved
Hide resolved
src/main/scala/org/broadinstitute/gpp/poolq3/barcode/DmuxedPairedEndBarcodeSource.scala
Outdated
Show resolved
Hide resolved
@@ -181,3 +181,19 @@ test-multiple-inputs: | |||
diff lognormalized-counts.txt ../../test-data/lognormalized-counts.txt && \ | |||
diff barcode-counts.txt ../../test-data/barcode-counts.txt && \ | |||
diff correlation.txt ../../test-data/correlation.txt) | |||
|
|||
test-demultiplexed: wd = $(test-output-dir)/demultiplexed | |||
test-demultiplexed: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The length limit for a unix command line is 4096 chars, which is starting to look achievable with this new demuxed input mode. I guess that is all the more reason to switch to a JSON config file--was that just for pq-launcher or for pq as well?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wasn't planning to implement it in PoolQ in this PR. I think that is how PoolQ4 will work. In our case, the poolq launcher constructs the config object directly from its command line, but I'm planning to change the poolq launcher command line to take only 2 parameters: --db
and --job-id
, which it will use to read everything it needs from the database, obviating the need for command lines that might exceed the limit.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great. I have one performance-related comment/question, and then 🧼 🧇 🦕
PQ4 might be a good time to make the switch from |
I don't know about |
Another PQ4 thought, I'm currently getting FASTQ records as |
No description provided.