Updating for last run by javidsegura · Pull Request #2 · stride-research/IMPRESS

javidsegura · 2025-09-19T15:13:13Z

No description provided.

…n/fix/task_description_bug Fix Task description Specs.

…n/feature/deployment_improve Feature/deployment improve

coderabbitai · 2025-09-19T15:13:22Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

✨ Finishing touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch main

Tip

👮 Agentic pre-merge checks are now available in preview!

Pro plan users can now enable pre-merge checks in their settings to enforce checklists before merging PRs.

Built-in checks – Quickly apply ready-made checks to enforce title conventions, require pull request descriptions that follow templates, validate linked issues for compliance, and more.
Custom agentic checks – Define your own rules using CodeRabbit’s advanced agentic capabilities to enforce organization-specific policies and workflows. For example, you can instruct CodeRabbit’s agent to verify that API documentation is updated whenever API schema files are modified in a PR. Note: Upto 5 custom checks are currently allowed during the preview period. Pricing for this feature will be announced in a few weeks.

Please see the documentation for more information.

Example:

reviews:
  pre_merge_checks:
    custom_checks:
      - name: "Undocumented Breaking Changes"
        mode: "warning"
        instructions: |
          Pass/fail criteria: All breaking changes to public APIs, CLI flags, environment variables, configuration keys, database schemas, or HTTP/GraphQL endpoints must be documented in the "Breaking Change" section of the PR description and in CHANGELOG.md. Exclude purely internal or private changes (e.g., code not exported from package entry points or explicitly marked as internal).

Please share your feedback with us on this Discord post.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2025-09-19T15:13:30Z

Summary of Changes

Hello @javidsegura, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly refines the AlphaFold protein binding pipeline by introducing and formalizing a 'Separate Pipelines Design' for handling multiple structures. It includes comprehensive documentation of the new and existing pipeline architectures, alongside core logic changes that enable child pipelines to operate more efficiently by skipping redundant initial steps. The update also incorporates minor fixes for path handling and enhances logging for better traceability of adaptive decisions within the pipeline.

Highlights

New Pipeline Design Documentation: A new README file has been added to explain two distinct AlphaFold pipeline designs: a 'Single Pipeline with Parallel Structures' and a 'Separate Pipelines Design', detailing how each handles multiple structures and GPU binding.
Enhanced Child Pipeline Management: The pipeline now supports more sophisticated management of child pipelines, introducing is_child and start_pass parameters to configure their behavior, particularly allowing them to skip initial processing steps.
Optimized Child Pipeline Execution: Child pipelines can now conditionally skip the MPNN and sequence ranking steps during their initial pass, preventing redundant computations and improving overall efficiency.
Improved Logging and Path Handling: Logging for adaptive decisions has been added, and path construction for AlphaFold output directories has been corrected and expanded to include necessary subdirectories.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces an adaptive optimization strategy by allowing pipelines to spawn child pipelines for proteins with degrading quality scores. The changes are generally well-structured, including a new README to explain the design. However, I've identified a high-severity logic bug where proteins could be inadvertently dropped from processing. I've also noted several medium-severity issues related to Python best practices, such as the use of mutable default arguments and unnecessary code, along with a minor typo in a log message. Addressing these points will improve the robustness and maintainability of the new feature.

gemini-code-assist · 2025-09-19T15:15:49Z

examples/protien_binding_usecase/run_protein_binding.py

+    else:
+        pipeline.previous_scores = copy.deepcopy(pipeline.current_scores)


There is a potential data loss bug here. If sub_iter_seqs is not empty but a child pipeline cannot be created (e.g., pipeline.sub_order >= MAX_SUB_PIPELINES), this else block is executed. The proteins in sub_iter_seqs have already been removed from pipeline.iter_seqs on line 76. Since this block only updates previous_scores, those proteins are effectively dropped from any further processing. They should be added back to the current pipeline's iter_seqs if they are not moved to a child.

Consider this implementation:

else: if sub_iter_seqs: # If a child pipeline could not be created, add the sequences back to the parent. pipeline.iter_seqs.update(sub_iter_seqs) pipeline.previous_scores = copy.deepcopy(pipeline.current_scores)

gemini-code-assist · 2025-09-19T15:15:49Z

examples/protien_binding_usecase/run_protein_binding.py


        decision = await adaptive_criteria(curr_score, pipeline.previous_scores[protein])
-
+        pipeline.logger.pipeline_log(f'Adaptive descision: {decision}')


There is a typo in the log message. 'descision' should be 'decision'.

Suggested change

pipeline.logger.pipeline_log(f'Adaptive descision: {decision}')

pipeline.logger.pipeline_log(f'Adaptive decision: {decision}')

gemini-code-assist · 2025-09-19T15:15:49Z

src/impress/pipelines/protein_binding.py

-        async def s1(task_description=None):
-            if task_description is None:
-                task_description = {"ranks": 1}
+        async def s1(task_description={"gpus_per_rank": 1}):  # noqa: B006


Using a mutable default argument like a dictionary is generally discouraged as it can lead to unexpected behavior if the object is modified. While it is not modified here, the previous pattern of using None as a default and creating the dictionary within the function is safer and improves maintainability. The noqa: B006 indicates awareness, but adhering to best practices would be better in the long run.

A safer implementation would be:

async def s1(task_description=None): if task_description is None: task_description = {"gpus_per_rank": 1} # ... rest of function

gemini-code-assist · 2025-09-19T15:15:49Z

src/impress/pipelines/protein_binding.py

-        async def s4(target_fasta, task_description=None):
-            if task_description is None:
-                task_description = {"gpus_per_rank": 1}
+        async def s4(target_fasta, task_description={"gpus_per_rank": 1}):  # noqa: B006


Similar to the comment on s1, using a mutable default argument here is risky for future maintenance. It's safer to initialize the default to None and create the dictionary inside the function body to avoid potential side effects if the function logic changes.

A safer implementation would be:

async def s4(target_fasta, task_description=None): if task_description is None: task_description = {"gpus_per_rank": 1} # ... rest of function

gemini-code-assist · 2025-09-19T15:15:49Z

src/impress/pipelines/protein_binding.py

-            if task_description is None:
-                task_description = {}
+        @self.auto_register_task()  # pLDTT_extract
+        async def s5(task_description={}):  # noqa: B006


As with s1 and s4, using a mutable default argument is not recommended. To improve code safety and maintainability, please consider using None as the default and creating the dictionary inside the function.

A safer implementation would be:

async def s5(task_description=None): if task_description is None: task_description = {} # ... rest of function

gemini-code-assist · 2025-09-19T15:15:49Z

src/impress/pipelines/protein_binding.py

+                    "in the current pass only."
+                )
+
+                pass


The pass statement here is unnecessary because the if block already contains a logging statement and is not empty. It can be safely removed for cleaner code.

AymenFJA and others added 12 commits September 11, 2025 18:21

fix

6244cfa

ruff

faa2d41

Merge pull request radical-collaboration#32 from radical-collaboratio…

d4e034f

…n/fix/task_description_bug Fix Task description Specs.

refine

d6a1095

1st working child pipe dep.

a3e3ee0

cleanup

5842793

readme for pipeline design

7e87190

cleanup

b734d50

finalize

3c4d245

lint

ed13a7e

lin2

c8cc083

Merge pull request radical-collaboration#34 from radical-collaboratio…

387f01a

…n/feature/deployment_improve Feature/deployment improve

gemini-code-assist bot reviewed Sep 19, 2025

View reviewed changes

Merge branch 'agnosticAgent' into main

b8e6f50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating for last run#2

Updating for last run#2
javidsegura wants to merge 13 commits intoagnosticAgentfrom
main

javidsegura commented Sep 19, 2025

Uh oh!

coderabbitai bot commented Sep 19, 2025 •

edited

Loading

Review skipped

Other AI code review bot(s) detected

Uh oh!

gemini-code-assist bot commented Sep 19, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 19, 2025

Uh oh!

gemini-code-assist bot Sep 19, 2025

Uh oh!

gemini-code-assist bot Sep 19, 2025

Uh oh!

gemini-code-assist bot Sep 19, 2025

Uh oh!

gemini-code-assist bot Sep 19, 2025

Uh oh!

gemini-code-assist bot Sep 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		else:
		pipeline.previous_scores = copy.deepcopy(pipeline.current_scores)


		decision = await adaptive_criteria(curr_score, pipeline.previous_scores[protein])

		pipeline.logger.pipeline_log(f'Adaptive descision: {decision}')

Conversation

javidsegura commented Sep 19, 2025

Uh oh!

coderabbitai bot commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Other AI code review bot(s) detected

Uh oh!

gemini-code-assist bot commented Sep 19, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Sep 19, 2025 •

edited

Loading