Feature/scorep feedback #42

OutlyingWest · 2025-04-26T12:43:50Z

Description of Changes

Added and documented a logging system.
Implemented animation for long-running processes in Score-P mode (both for data transfer to the subprocess and for displaying progress animation within the subprocess itself).
Added an example of execution of the long running task - Large array processing with Score-P in examples/ExampleBasic.ipynb
Changed the way subprocess output streams are read — moved this logic into a separate function read_scorep_process_pipe().
Brought tests into a working state:
- Created a new function to clean up garbage from standard output.
- Added a context manager with self.subTest() to better identify which cell failed during execution.
- Temporarily updated expected test outputs to account for a new line introduced for proper animation handling (clear_line string in read_scorep_process_pipe()).

…mprovements 1. Added and documented a logging system. 2. Implemented animation for long-running processes in Score-P mode (both for data transfer to the subprocess and for displaying progress animation within the subprocess itself). 3. Added an example of execution of the long running task – Large array processing with Score-P in `examples/ExampleBasic.ipynb`. 4. Changed the way subprocess output streams are read — moved this logic into a separate function `read_scorep_process_pipe()`. 5. Brought tests into a working state: - Created a new function to clean up garbage from standard output. - Added a context manager `with self.subTest()` to better identify which cell failed during execution. - Temporarily updated expected test outputs to account for a new line introduced for proper animation handling (`clear_line` string in `read_scorep_process_pipe()`).

src/jumper/kernel.py

elwer

For logging, I have the following recommendations:

Check whether the logging levels are appropriate. For example, on line 923 of kernel.py, the message '{os.environ["SCOREP_EXPERIMENT_DIRECTORY"]=}' should not be logged as a warning.
Check the consistency of the logging messages. For example, we have variations of the same message:
'Cell execution failed, cell persistence was not recorded', 'Failed to load cell's persistence to the notebook', 'Failed to pickle notebook's persistence'. These sound relatively similar, although there are differences. Perhaps you could make them more meaningful by adding the context of the persistence step (e.g. Jupyter parent -> Scorep child). You could also add the type of serialisation (memory/disk, serialiser, etc.).
What could help here is an enum to define the potential error messages and, for the future, hints on how to resolve them.

elwer · 2025-05-26T12:41:31Z

src/jumper/kernel.py

+
+    def read_scorep_process_pipe(self, proc: subprocess.Popen[bytes], stdout_lock: threading.Lock) -> list:
+        """
+        Reads and processes the output of a subprocess running with Score-P instrumentation.
+        Args:
+            proc (subprocess.Popen[bytes]): The subprocess whose output is being read.
+            stdout_lock (threading.Lock): Lock to avoid output overlapping
+
+        Returns:
+            list: A list of decoded strings containing "MCM_TS" timestamps.
+        """
+        multicellmode_timestamps = []
+        sel = selectors.DefaultSelector()
+
+        sel.register(proc.stdout, selectors.EVENT_READ)
+        sel.register(proc.stderr, selectors.EVENT_READ)
+
+        line_width = 50
+        clear_line = "\r" + " " * line_width + "\r"
+
+        while True:
+            # Select between stdout and stderr
+            for key, val in sel.select():
+                line = key.fileobj.readline()
+                if not line:
+                    sel.unregister(key.fileobj)
+                    continue
+
+                decoded_line = line.decode(sys.getdefaultencoding(), errors='ignore')
+
+                if key.fileobj is proc.stderr:
+                    with stdout_lock:
+                        self.log.warning(f'{decoded_line.strip()}')
+                elif 'MCM_TS' in decoded_line:
+                    multicellmode_timestamps.append(decoded_line)
+                else:
+                    with stdout_lock:
+                        sys.stdout.write(clear_line)
+                        sys.stdout.flush()
+                        self.cell_output(decoded_line)
+
+            # If both stdout and stderr empty -> out of loop
+            if not sel.get_map():
+                break
+
+        return multicellmode_timestamps
+
+


Please add a bit of documentation here

Documentation extended in 26c9b81

tests/test_kernel.py

…re fixed in branch 'test/improvements'

OutlyingWest · 2025-06-02T13:02:15Z

For logging, I have the following recommendations:

Check whether the logging levels are appropriate. For example, on line 923 of kernel.py, the message '{os.environ["SCOREP_EXPERIMENT_DIRECTORY"]=}' should not be logged as a warning.

Check the consistency of the logging messages. For example, we have variations of the same message:
'Cell execution failed, cell persistence was not recorded', 'Failed to load cell's persistence to the notebook', 'Failed to pickle notebook's persistence'. These sound relatively similar, although there are differences. Perhaps you could make them more meaningful by adding the context of the persistence step (e.g. Jupyter parent -> Scorep child). You could also add the type of serialisation (memory/disk, serialiser, etc.).
What could help here is an enum to define the potential error messages and, for the future, hints on how to resolve them.

Adjusted logging level in 913df54
Unified error message formats in commit 32d2811. Included enum for error types and improved message context as suggested.

OutlyingWest force-pushed the feature/scorep_feedback branch from 03754b2 to 1e56026 Compare May 7, 2025 11:27

OutlyingWest and others added 2 commits May 13, 2025 17:02

wrong spinner status while kernel interruption fixed

161788e

fix unit_test.yml to ubuntu 22.04

0b68076

MandaloreUltimate reviewed May 17, 2025

View reviewed changes

src/jumper/kernel.py Outdated Show resolved Hide resolved

OutlyingWest and others added 4 commits May 18, 2025 19:42

run tests as modules in unit_test.yml

c73b14b

Removed unnecessary pympler package

8774436

test writemode output fixed

54ef7ce

logging imports polishing

a638a64

elwer requested changes May 26, 2025

View reviewed changes

elwer reviewed May 26, 2025

View reviewed changes

tests/test_kernel.py Outdated Show resolved Hide resolved

OutlyingWest added 5 commits May 27, 2025 14:01

DISABLE_PROCESSING_ANIMATIONS env variable description added

1d7c23d

read_scorep_process_pipe documentation extended

26c9b81

A test with ambiguous behavior was removed. Problems of that kind whe…

f95769e

…re fixed in branch 'test/improvements'

kernel error messages system refactored and covered by tests

32d2811

minor logging and messages improvement

913df54

OutlyingWest closed this Jun 2, 2025

OutlyingWest reopened this Jun 2, 2025

OutlyingWest requested a review from elwer June 2, 2025 13:04

Jumper evironment variables names enriched by JUMPER_* prefix

b19a0c3

elwer approved these changes Jun 5, 2025

View reviewed changes

elwer merged commit d350f21 into master Jun 5, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/scorep feedback #42

Feature/scorep feedback #42

Uh oh!

OutlyingWest commented Apr 26, 2025

Uh oh!

Uh oh!

elwer left a comment

Uh oh!

elwer May 26, 2025

Uh oh!

OutlyingWest Jun 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

OutlyingWest commented Jun 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Feature/scorep feedback #42

Feature/scorep feedback #42

Uh oh!

Conversation

OutlyingWest commented Apr 26, 2025

Uh oh!

Uh oh!

elwer left a comment

Choose a reason for hiding this comment

Uh oh!

elwer May 26, 2025

Choose a reason for hiding this comment

Uh oh!

OutlyingWest Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

OutlyingWest commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

OutlyingWest Jun 2, 2025 •

edited

Loading

OutlyingWest commented Jun 2, 2025 •

edited

Loading