Update check_modular_conversion #37456

qubvel · 2025-04-11T14:14:32Z

What does this PR do?

Adds multiprocessing for processing modular files (with and without fix_and_overwrite flag)
While checking, we always should overwrite files to be sure we did not miss any conversion

HuggingFaceDocBuilderDev · 2025-04-11T14:40:41Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qubvel · 2025-04-14T09:56:34Z

tests/repo_utils/modular/test_conversion_order.py

@@ -23,7 +23,7 @@
    os.path.join(MODEL_ROOT, "rt_detr", "modular_rt_detr.py"),
    os.path.join(MODEL_ROOT, "qwen2", "modular_qwen2.py"),
    os.path.join(MODEL_ROOT, "qwen3", "modular_qwen3.py"),
-    os.path.join(MODEL_ROOT, "qwen3", "modular_qwen3_moe.py"),
+    os.path.join(MODEL_ROOT, "qwen3_moe", "modular_qwen3_moe.py"),


It looks like this test is not running on CI or failed.

ydshieh · 2025-04-15T08:32:56Z

@qubvel Thank you for working on this.

I have admitted that I have to check the sort and dependency stuff here to get better understanding before I can judge correctly.

However I have a question: since we are running with multiple processes and each would modify some modeling files (i.e. generate them from modular files), it's not very clear to me that we are free of the race condition.

Maybe this won't happen from the way we (and you) handle the dependency and the control flow, but I find it's kind difficult to see the logic clearly.

I will leave some comments and questions in the PR changes so we can discuss in more specific positions.

ydshieh · 2025-04-15T08:38:03Z

utils/create_dependency_mapping.py

        # Remove the leafs from the graph (and from the deps of other nodes)
        graph = {node: deps - leaf_nodes for node, deps in graph.items() if node not in leaf_nodes}

-    return [name_mapping[x] for x in sorting_list]
+    return sorting_list


it seems to me that the return type is changed from a list of string (of modular file paths) to list[list[str]].
Would be nice to explain this (i.e. the algorithm), and having a docstring about it (which is also missing on main).

The caller (find_priority_list) to topological_sort still have ordered_files as variable, and its docstring is still A tuple with the ordered files (list). If I don't make mistake, this is no longer the case. So should update there too?

ydshieh · 2025-04-15T08:44:57Z

utils/check_modular_conversion.py

+    console.print(f"[bold yellow]Files per level: {tuple([len(x) for x in ordered_files])}[/bold yellow]")
+
+    try:
+        for dependency_level_files in ordered_files:


at this moment, it's not easy to understand dependency_level_files and the what ordered_files is

ydshieh · 2025-04-15T08:53:17Z

utils/check_modular_conversion.py

+                if not args.check_all and guaranteed_no_diff(file_path, dependencies, models_in_diff):
+                    skipped_models.add(file_path.split("/")[-2])  # save model folder name
+                else:
+                    files_to_check.append(file_path)


so files_to_check a list of of modular file paths and there won't be any duplicated elements?

qubvel · 2025-04-15T11:07:59Z

Thanks for the review, @ydshieh. That's definitely a fair point. I will add more comments to clarify the algorithm!

qubvel added 6 commits April 11, 2025 13:32

Change topological sort to return level-based output (lists of lists)

7e6866e

Update main for modular converter

e41c2e1

Update test

8d0d83f

update check_modular_conversion

d02cf0b

Update gitignore

888ba66

Fix missing conversion for glm4

af7cd0c

qubvel added 3 commits April 11, 2025 15:40

Update

141ee5d

Fix error msg

405360c

Fixup

09abf85

qubvel requested a review from ydshieh April 14, 2025 09:54

qubvel marked this pull request as ready for review April 14, 2025 09:54

Merge branch 'main' into update-check-modular-conversion

86401d4

qubvel commented Apr 14, 2025

View reviewed changes

ydshieh reviewed Apr 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update check_modular_conversion #37456

Update check_modular_conversion #37456

qubvel commented Apr 11, 2025

HuggingFaceDocBuilderDev commented Apr 11, 2025

qubvel Apr 14, 2025

ydshieh commented Apr 15, 2025

ydshieh Apr 15, 2025

ydshieh Apr 15, 2025

ydshieh Apr 15, 2025

ydshieh Apr 15, 2025

qubvel commented Apr 15, 2025

Update check_modular_conversion #37456

Are you sure you want to change the base?

Update check_modular_conversion #37456

Conversation

qubvel commented Apr 11, 2025

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 11, 2025

qubvel Apr 14, 2025

Choose a reason for hiding this comment

ydshieh commented Apr 15, 2025

ydshieh Apr 15, 2025

Choose a reason for hiding this comment

ydshieh Apr 15, 2025

Choose a reason for hiding this comment

ydshieh Apr 15, 2025

Choose a reason for hiding this comment

ydshieh Apr 15, 2025

Choose a reason for hiding this comment

qubvel commented Apr 15, 2025