Add tests #26

ethan-tonic · 2025-03-19T16:30:47Z

No description provided.

…d of 'textual_sdk_tests'

gandersteele

Looks good, but need to have coverage of synthesis mode. See above comments. Once env vars are added to secrets we'll merge

gandersteele · 2025-03-22T17:16:35Z

tests/sample.env

@@ -0,0 +1,12 @@
+/* retrieve from 1pass. Note is called pytest env file */


do we need this in the sdk repo? only relevant for backend

i see, this is for the pipeline tests

gandersteele · 2025-03-22T17:28:05Z

tests/utils/dataset_utils.py

+def check_dataset_str(original_text: str, dataset_str: str):
+    # Extract all redacted portions using regex pattern for [ENTITY_TYPE_*]
+    redaction_pattern = r"\[([A-Z_]+)(?:_[a-zA-Z0-9]+)?\]"
+    redactions = re.findall(redaction_pattern, dataset_str)
+
+    # Replace all redactions with empty string to get the non-redacted text
+    non_redacted_text = re.sub(redaction_pattern, "", dataset_str)
+
+    # Check if the non-redacted portions exist in the original text
+    for segment in non_redacted_text.split():
+        if segment.strip():  # Skip empty segments
+            assert segment in original_text, (
+                f"Non-redacted segment '{segment}' not found in original text"
+            )
+
+    # Ensure we found at least one redaction
+    assert len(redactions) > 0, "No redactions found in the dataset string"


this is good, but note that it doesnt apply in synthesis mode. i'd suggest a similar method that
1.asserts len(spans) > 0
2. asserts that original_text[span['start']:span['end']] == span['text']
3. asserts that dataset_str[span['new_start']:span['new_end']] == span['new_text']
this is a slightly different test than yours, so can be done in addition to, but the main point is that this exercises the synthesis mode as well. we could add additioanl checks that in synthesis mode, replacement text doesnt contain the standard redaction pattern

ethan-tonic added 5 commits March 19, 2025 11:59

Initial add

bef4bcd

Remove reqs

0b831bc

Fix up workflows

8b3c777

Add environment variables for test execution in GitHub Actions

98b66f3

Update resource path validation to check for 'tests' directory instea…

01ebe5b

…d of 'textual_sdk_tests'

gandersteele requested changes Mar 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests #26

Add tests #26

ethan-tonic commented Mar 19, 2025

gandersteele left a comment

gandersteele Mar 22, 2025

gandersteele Mar 22, 2025

gandersteele Mar 22, 2025

		@@ -0,0 +1,12 @@
		/* retrieve from 1pass. Note is called pytest env file */

Add tests #26

Are you sure you want to change the base?

Add tests #26

Conversation

ethan-tonic commented Mar 19, 2025

gandersteele left a comment

Choose a reason for hiding this comment

gandersteele Mar 22, 2025

Choose a reason for hiding this comment

gandersteele Mar 22, 2025

Choose a reason for hiding this comment

gandersteele Mar 22, 2025

Choose a reason for hiding this comment