norm_adv_by_std_in_grpo should be from rllm.algorithm.norm_adv_by_std_in_grpo?

### Description

https://github.com/rllm-org/rllm/blob/main/rllm/experimental/unified_trainer.py#L216

Here we do `norm_adv_by_std_in_grpo=self.rllm_config.stepwise_advantage.get("norm_adv_by_std_in_grpo", True)`, but if I'm not wrong, the `norm_adv_by_std_in_grpo` flag is defined at rllm.algorithm.norm_adv_by_std_in_grpo in the config files.

### Steps to Reproduce

n/a

### Error Output / Traceback

```shell

```

### rLLM Version

latest main

### Training Backend

tinker

### Python Version

3.11.14

### GPU / CUDA Version

_No response_

### vLLM Version (if applicable)

_No response_

### Training Script / Config

```shell

```

### Additional Context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

norm_adv_by_std_in_grpo should be from rllm.algorithm.norm_adv_by_std_in_grpo? #447

Description

Steps to Reproduce

Error Output / Traceback

rLLM Version

Training Backend

Python Version

GPU / CUDA Version

vLLM Version (if applicable)

Training Script / Config

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

norm_adv_by_std_in_grpo should be from rllm.algorithm.norm_adv_by_std_in_grpo? #447

Description

Description

Steps to Reproduce

Error Output / Traceback

rLLM Version

Training Backend

Python Version

GPU / CUDA Version

vLLM Version (if applicable)

Training Script / Config

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions