Skip to content

Conversation

@flesher0813
Copy link
Contributor

Purpose

What this PR does / why we need it?

Support users save operation logs to replay trace. When enable using operation db, all load/save operations would be saved in target log_dir.

Modifications

Does this PR introduce any user-facing change?

Support replay traces, and users can modify the yaml file to meet their own needs.

Test

How was this patch tested?

image image

hero0307 and others added 2 commits November 27, 2025 08:46
* [Fix]remove dram docs and modify quick-start doc

* modify index.md

---------

Co-authored-by: t00939662 <[email protected]>
…framework (ModelEngine-Group#295)

Performance testing tool based on the PyTest testing framework.
@flesher0813 flesher0813 force-pushed the adapt_metrics_offline branch from c5852e9 to 0e000f6 Compare November 27, 2025 07:58
class AsyncUCMOperDB:
"""Asynchronous UCM operation log writer (supports log compression + auto-cleanup)"""

def _load_config(self, config_path: str) -> Dict[str, Any]:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we use unified config.py? because root dir also need config.yaml

config["kv_block_size"] / 1024 / 1024,
config["io_size"] / 1024,
)
if role == KVConnectorRole.WORKER and get_tp_group().is_first_rank:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we need a param to control whether to enable this function.

mag1c-h and others added 13 commits November 28, 2025 00:14
* [docs]add metrics doc

* modify metrics.md

* modify metrics.md

---------

Co-authored-by: t00939662 <[email protected]>
* [Feat] Implement kv cache broadcast in MLA in ucm_connector

* [Style] Change wait for broadcast into single task method
* add ucm mock connector

* fix chunk prefill bug
* [Feat] Support launch from config file

* [Docs] Update documents for launch with yaml

* [Fix] Change load only on first rank into configuration

* [Feat] Add support for hit ratio in yaml

* [Fix] Fix load only first rank in non mla scene
* adapt GQA & modify config.yaml

* move process to UCMDirectConnector

* fix comment

* modify hash function

* fix style

* code style and modify hash

* init parent_block_hash_value
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants