Skip to content

为什么在step>0的时候将do_sample设置为False呢? #62

@XingYing-stack

Description

@XingYing-stack

感谢作者杰出的工作,我注意到在tool_utils.py中,作者设置了next_data.meta_info['do_sample'] = False # step > 0 does not do sample

请问这是出于什么样子的考虑呢?我这边发现在reasoning模型下,这样可能导致重复?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions