chore: Increase backend test code coverage to 80% #431

AlexMadera · 2025-12-30T16:01:01Z

Base Branch

This PR targets the develop branch (required for all feature/fix PRs)
This PR targets main (hotfix only - maintainers)

Description

Related Issue

Closes #

Type of Change

Area

Frontend
Backend
Fullstack

Commit Message Format

Follow conventional commits: <type>: <subject>

Types: feat, fix, docs, style, refactor, test, chore

Example: feat: add user authentication system

Checklist

I've synced with develop branch
I've tested my changes locally
I've followed the code principles (SOLID, DRY, KISS)
My PR is small and focused (< 400 lines ideally)

CI/Testing Requirements

All CI checks pass
All existing tests pass
New features include test coverage
Bug fixes include regression tests

Screenshots

Before	After

Feature Toggle

Behind localStorage flag: use_feature_name
Behind settings toggle
Behind environment variable/config
N/A - Feature is complete and ready for all users

Breaking Changes

Breaking: Yes / No

Details:

Summary by CodeRabbit

Tests
- Added comprehensive test coverage for security validation, security parsing, service detection, spec pipeline management, task logging utilities, and UI components to enhance code reliability.
Chores
- Strengthened CI/CD pipeline with increased code coverage threshold.
- Configured backend coverage settings for consistent testing standards.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-30T16:01:07Z

📝 Walkthrough

Walkthrough

This PR enhances test coverage by raising the minimum coverage threshold from 20% to 75%, adds a comprehensive .coveragerc configuration file defining coverage exclusions, adds a pragma: no cover annotation to the CLI entry point, and introduces extensive test modules across the backend codebase validating git validators, security hooks, parsing logic, service detection, spec pipeline orchestration, task logging, and UI utilities.

Changes

Cohort / File(s)	Summary
CI & Coverage Configuration `.github/workflows/ci.yml`, `apps/backend/.coveragerc`	Updated pytest to use coverage config file and raised minimum coverage threshold from 20% to 75%; added new `.coveragerc` with source paths and comprehensive exclusions for CLI, agents, UI, tests, integrations, memory/db layers, and analysis modules; enabled `show_missing`
Code Annotations `apps/backend/cli/main.py`	Added `pragma: no cover` annotation to `main()` entry point
Test Suite Additions `tests/test_git_validators.py`, `tests/test_security_hooks.py`, `tests/test_security_parser.py`, `tests/test_services_detector.py`, `tests/test_spec_pipeline.py`, `tests/test_task_logger_utils.py`, `tests/test_ui_boxes.py`, `tests/test_ui_formatters.py`, `tests/test_validation_models.py`	Added 9 new comprehensive test modules covering: git commit validation (syntax errors, secret detection), security command validation and parsing, service type detection across dependencies, spec orchestration (naming, directory cleanup, renaming), task logger state management, UI box/divider rendering, UI formatting utilities (headers, sections, status displays), and ValidationResult string output

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

ci: implement enterprise-grade PR quality gates and security scanning #266: Modifies the same CI workflow file (.github/workflows/ci.yml) to adjust pytest coverage configuration flags and thresholds.

Suggested reviewers

AndyMik90
MikeeBuilds

Poem

🐰 Coverage climbs from twenty to seventy-five,
Nine test files help the backend thrive,
Secrets detected, specs orchestrated tight,
Config files banish the uncovered night! 🚀

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Title check	✅ Passed	The PR title 'chore: Increase backend test code coverage to 75%' accurately reflects the main objective of this changeset, which adds comprehensive test coverage across multiple modules and updates CI configuration to enforce a 75% coverage threshold.

✨ Finishing touches

📝 Generate docstrings

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2025-12-30T16:01:12Z

Thank you for your contribution! Before we can accept your PR, you need to sign our Contributor License Agreement (CLA).

To sign the CLA, please comment on this PR with exactly:

I have read the CLA Document and I hereby sign the CLA

You can read the full CLA here: CLA.md

Why do we need a CLA?

Auto Claude is licensed under AGPL-3.0. The CLA ensures the project has proper licensing flexibility should we introduce additional licensing options in the future.

You retain full copyright ownership of your contributions.

I have read the CLA Document and I hereby sign the CLA

0 out of 2 committers have signed the CLA.
❌ @AlexMadera
❌ @AndyMik90
_{You can retrigger this bot by commenting recheck in this Pull Request.}_{Posted by the CLA Assistant Lite bot.}

Mitsu13Ion · 2025-12-30T16:02:11Z

sorry this PR has too much changes
max 400 lines

tests/test_security_hooks.py

tests/test_task_logger_utils.py

+
+import sys
+from pathlib import Path
+from unittest.mock import MagicMock, patch


To fix unused imports, remove the names that are not referenced anywhere in the file. This eliminates unnecessary dependencies and silences the CodeQL warnings without changing runtime behavior.

In this file, neither MagicMock nor patch are used. The best fix is to delete the entire line from unittest.mock import MagicMock, patch at line 10. No other code changes or new imports are needed, and test behavior will remain identical.

tests/test_ui_boxes.py

coderabbitai

Actionable comments posted: 8

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/test_spec_pipeline.py (1)

22-22: Fix path casing to match actual directory name.

Line 22 uses "Apps" (capital A), but the actual directory is "apps" (lowercase). On case-sensitive filesystems (Linux, macOS with case-sensitive APFS), this will cause import failures. Note: Multiple test files have this same casing inconsistency and should be standardized to "apps".

♻️ Duplicate comments (3)

tests/test_security_hooks.py (1)

1-17: Note: Address unused import flagged by static analysis.

A previous code scanning alert identified an unused pytest import. Since pytest fixtures are available through conftest.py without explicit import, the direct pytest import can be removed if present.

tests/test_ui_boxes.py (1)

1-13: Note: Address unused import flagged by static analysis.

The patch import from unittest.mock is flagged as unused by static analysis. Since it's not used in any of the test methods, it should be removed.
tests/test_task_logger_utils.py (1)
10-10: Remove unused imports.

The imports MagicMock and patch from unittest.mock are not used anywhere in this test file.
🔎 Proposed fix
 import sys
 from pathlib import Path
-from unittest.mock import MagicMock, patch
 
 sys.path.insert(0, str(Path(__file__).parent.parent / "apps" / "backend"))

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between bdb0154 and 2563282.

📒 Files selected for processing (12)

.github/workflows/ci.yml
apps/backend/.coveragerc
apps/backend/cli/main.py
tests/test_git_validators.py
tests/test_security_hooks.py
tests/test_security_parser.py
tests/test_services_detector.py
tests/test_spec_pipeline.py
tests/test_task_logger_utils.py
tests/test_ui_boxes.py
tests/test_ui_formatters.py
tests/test_validation_models.py

🧰 Additional context used

📓 Path-based instructions (2)

apps/backend/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

apps/backend/**/*.py: Always use the Claude Agent SDK (create_client() from core.client) for AI interactions - NEVER use anthropic.Anthropic() directly
Use create_client() from apps/backend/core/client.py with proper parameters: project_dir, spec_dir, model, agent_type, and optional max_thinking_tokens

Files:

apps/backend/cli/main.py

⚙️ CodeRabbit configuration file

apps/backend/**/*.py: Focus on Python best practices, type hints, and async patterns.
Check for proper error handling and security considerations.
Verify compatibility with Python 3.12+.

Files:

apps/backend/cli/main.py

tests/**

⚙️ CodeRabbit configuration file

tests/**: Ensure tests are comprehensive and follow pytest conventions.
Check for proper mocking and test isolation.

Files:

tests/test_security_parser.py
tests/test_spec_pipeline.py
tests/test_task_logger_utils.py
tests/test_ui_boxes.py
tests/test_services_detector.py
tests/test_ui_formatters.py
tests/test_validation_models.py
tests/test_git_validators.py
tests/test_security_hooks.py

🧠 Learnings (2)

📚 Learning: 2025-12-25T18:29:32.954Z

Learnt from: CR
Repo: AndyMik90/Auto-Claude PR: 0
File: CLAUDE.md:0-0
Timestamp: 2025-12-25T18:29:32.954Z
Learning: Run backend tests using the virtual environment pytest: `apps/backend/.venv/bin/pytest` from the root directory

Applied to files:

.github/workflows/ci.yml

📚 Learning: 2025-12-25T18:29:32.954Z

Learnt from: CR
Repo: AndyMik90/Auto-Claude PR: 0
File: CLAUDE.md:0-0
Timestamp: 2025-12-25T18:29:32.954Z
Learning: Create spec directory structure with: `spec.md`, `requirements.json`, `context.json`, `implementation_plan.json`, `qa_report.md`, and optional `QA_FIX_REQUEST.md`

Applied to files:

tests/test_spec_pipeline.py

🧬 Code graph analysis (4)

tests/test_spec_pipeline.py (1)

apps/backend/spec/pipeline/orchestrator.py (1)

_generate_spec_name (650-664)

tests/test_services_detector.py (1)

apps/backend/analysis/analyzers/context/services_detector.py (1)

ServicesDetector (25-215)

tests/test_ui_formatters.py (1)

apps/backend/integrations/graphiti/test_ollama_embedding_memory.py (1)

print_header (78-82)

tests/test_security_hooks.py (1)

tests/conftest.py (1)

docker_project (324-360)

🪛 GitHub Check: CodeQL

tests/test_task_logger_utils.py

[notice] 10-10: Unused import
Import of 'MagicMock' is not used.
Import of 'patch' is not used.

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: CodeQL (python)
GitHub Check: CodeQL (javascript-typescript)

🔇 Additional comments (13)

apps/backend/cli/main.py (1)

261-261: LGTM! Appropriate coverage exclusion for CLI entry point.

The pragma: no cover annotation is correctly applied to the main entry point, which is a standard practice for CLI interfaces that require interactive input and are difficult to test meaningfully in unit tests.

tests/test_security_hooks.py (2)

19-65: LGTM! Comprehensive test coverage for command validation.

The test suite thoroughly covers various command types, edge cases, and parsing scenarios. Good use of descriptive test names and assertion messages.

67-114: LGTM! Thorough profile-based and edge case testing.

The tests properly validate command handling across different project types (Python, Node, Docker) and handle edge cases like environment variables, quotes, and heredoc patterns. The defensive assertions using isinstance(ok, bool) are appropriate for edge cases where behavior may vary.

apps/backend/.coveragerc (2)

1-50: LGTM! Well-structured coverage configuration with clear rationale.

The coverage configuration is well-organized with clear comments explaining the rationale for each exclusion (e.g., "require interactive input", "require Claude SDK sessions", "requires database"). This helps maintainers understand the coverage strategy.

207-217: LGTM! Standard report exclusions.

The report exclusion patterns are standard and appropriate, covering common patterns like pragma: no cover, __repr__, abstract methods, and if __name__ == "__main__": blocks.

.github/workflows/ci.yml (1)

54-61: LGTM! Coverage configuration correctly integrated.

The pytest command properly references the new .coveragerc configuration and raises the minimum coverage threshold to 75%. The working directory and paths are correctly set, and the virtual environment is activated before running tests (as per project learnings).

tests/test_ui_boxes.py (1)

14-121: LGTM! Comprehensive test coverage for UI box utilities.

The tests thoroughly cover the box() and divider() functions with various configurations including:

Different content types (string, list)

Title alignment options

Style variations (light, heavy)

Edge cases (long lines, ANSI codes)

Various widths

The on-demand import pattern is also good practice.

tests/test_validation_models.py (1)

1-103: LGTM! Thorough test coverage for ValidationResult string formatting.

The tests comprehensively validate the __str__ method output across different states:

Checkpoint name display

PASS/FAIL status

Errors and warnings formatting

Conditional fix display (only when invalid)

Clean output for empty lists

The test names are descriptive and the assertions are clear.

tests/test_ui_formatters.py (1)

16-207: LGTM! Comprehensive test coverage for UI formatting functions.

The tests thoroughly cover all formatting functions with proper stdout capture using StringIO and patch. Good coverage of different status types and formatting options.

tests/test_services_detector.py (1)

1-149: LGTM! Excellent test coverage for ServicesDetector.

The tests comprehensively cover service detection across all categories:

Databases (PostgreSQL, MongoDB, Redis)

Cache services

Message queues (Celery)

Payment processors (Stripe)

Email providers (SendGrid)

Storage (AWS S3)

Auth providers (JWT)

Monitoring (Sentry)

The tests properly handle both Python (requirements.txt) and Node.js (package.json) dependencies, and include important edge cases like multiple databases and no services detected. Good use of isolated test fixtures.

tests/test_git_validators.py (3)

43-57: Excellent test coverage for git command variants.

These tests effectively validate that non-commit git operations pass through while ensuring proper error handling for invalid syntax. The coverage of git status, git diff, and malformed commands strengthens the validation logic.

81-101: Strong test for security-critical path.

The test properly validates that commits with detected secrets are blocked and that error messages include essential information (file path, pattern name). This is critical for preventing credential leaks.

170-193: Good validation of comprehensive error reporting.

Testing that multiple files with secrets are all reported ensures users get complete information to remediate all issues in a single pass, improving the developer experience.

coderabbitai · 2025-12-30T16:25:21Z

apps/backend/.coveragerc

+    # Security hooks (require project context)
+    security/hooks.py


⚠️ Potential issue | 🟠 Major

Remove security/hooks.py from coverage exclusions.

The file security/hooks.py is excluded from coverage with the rationale "require project context", but tests/test_security_hooks.py successfully tests the validate_command function from this module. This suggests the module is testable and should be included in coverage metrics. Excluding tested code undermines the PR's goal of increasing coverage to 75%.

🔎 Proposed fix

- # Security hooks (require project context) - security/hooks.py -

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Security hooks (require project context)

security/hooks.py

🤖 Prompt for AI Agents

In apps/backend/.coveragerc around lines 195-196, remove the exclusion entry "security/hooks.py" so the file is no longer omitted from coverage reporting; update the .coveragerc by deleting that line (and any trailing blank line if needed) to allow tests like tests/test_security_hooks.py to count toward overall coverage.

coderabbitai · 2025-12-30T16:25:21Z

tests/test_git_validators.py

+    def test_empty_command_allowed(self):
+        """Empty command passes through."""
+        ok, msg = validate_git_commit("")
+        assert ok is True
+


🧹 Nitpick | 🔵 Trivial

Add message assertion for consistency.

For consistency with other similar tests (e.g., test_non_git_command_allowed, test_git_add_allowed), consider asserting that the message is empty when the command is allowed.

🔎 Proposed fix

def test_empty_command_allowed(self): """Empty command passes through.""" ok, msg = validate_git_commit("") assert ok is True + assert msg == ""

🤖 Prompt for AI Agents

In tests/test_git_validators.py around lines 26 to 30, the test test_empty_command_allowed lacks an assertion on the returned message for consistency with other tests; update the test to also assert that msg is an empty string (or whatever the validator returns for allowed commands) after asserting ok is True so the test verifies both success and the empty message.

coderabbitai · 2025-12-30T16:25:21Z

tests/test_security_parser.py

+    def test_empty_command(self):
+        """Handles empty command string."""
+        commands = extract_commands("")
+        assert commands == [] or commands == set()


🧹 Nitpick | 🔵 Trivial

Tighten assertion to specify exact return type.

The test accepts either an empty list or empty set, which doesn't clearly specify the expected behavior. Tests should assert the exact return type and value to catch implementation changes.

🔎 Proposed fix

def test_empty_command(self): """Handles empty command string.""" commands = extract_commands("") - assert commands == [] or commands == set() + # Verify with the actual implementation and use the correct type + assert commands == [] # or assert commands == set() if implementation returns set

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def test_empty_command(self):

"""Handles empty command string."""

commands = extract_commands("")

assert commands == [] or commands == set()

def test_empty_command(self):

"""Handles empty command string."""

commands = extract_commands("")

# Verify with the actual implementation and use the correct type

assert commands == [] # or assert commands == set() if implementation returns set

🤖 Prompt for AI Agents

In tests/test_security_parser.py around lines 53 to 56, the test for an empty command accepts either an empty list or empty set which is ambiguous; change the assertion to require an exact type and value (e.g., assert commands == [] and optionally assert isinstance(commands, list)) so the test fails if the function returns the wrong container type.

coderabbitai · 2025-12-30T16:25:21Z

tests/test_security_parser.py

+    def test_empty_string(self):
+        """Handles empty string."""
+        segments = split_command_segments("")
+        assert segments == [] or segments == [""]


🧹 Nitpick | 🔵 Trivial

Tighten assertion for empty string handling.

Similar to the empty command test, this assertion is overly permissive. Specify the exact expected return value.

🔎 Proposed fix

def test_empty_string(self): """Handles empty string.""" segments = split_command_segments("") - assert segments == [] or segments == [""] + # Verify with the actual implementation + assert segments == [] # or assert segments == [""] depending on implementation

Committable suggestion skipped: line range outside the PR's diff.

🤖 Prompt for AI Agents

In tests/test_security_parser.py around lines 103 to 106, the test for an empty string is too permissive (it allows [] or [""]); change the assertion to expect a single exact value matching project convention — assert segments == [] — so update the test to assert segments == [] (and if the implementation returns [""], modify split_command_segments to return [] for an empty input instead).

coderabbitai · 2025-12-30T16:25:21Z

tests/test_security_parser.py

+    def test_no_matching_segment(self):
+        """Returns None when no match found."""
+        segments = ["ls -la", "pwd"]
+        result = get_command_for_validation("git", segments)
+        # May return None or empty depending on implementation
+        assert result is None or result == ""
+
+    def test_empty_segments(self):
+        """Handles empty segments list."""
+        result = get_command_for_validation("cmd", [])
+        # May return None or empty depending on implementation
+        assert result is None or result == ""
+
+    def test_command_with_path(self):
+        """Matches command even with path prefix."""
+        segments = ["/usr/bin/python script.py"]
+        result = get_command_for_validation("python", segments)
+        # May or may not match depending on implementation
+        assert result is None or "python" in result


🧹 Nitpick | 🔵 Trivial

Replace permissive assertions with specific expected values.

Multiple assertions use "or" conditions to accept different return values (None vs empty string, different command formats). This reduces test precision and could mask bugs.

🔎 Proposed fix

def test_no_matching_segment(self): """Returns None when no match found.""" segments = ["ls -la", "pwd"] result = get_command_for_validation("git", segments) - # May return None or empty depending on implementation - assert result is None or result == "" + assert result is None # or assert result == "" based on actual behavior def test_empty_segments(self): """Handles empty segments list.""" result = get_command_for_validation("cmd", []) - # May return None or empty depending on implementation - assert result is None or result == "" + assert result is None # or assert result == "" based on actual behavior def test_command_with_path(self): """Matches command even with path prefix.""" segments = ["/usr/bin/python script.py"] result = get_command_for_validation("python", segments) - # May or may not match depending on implementation - assert result is None or "python" in result + # If path matching is supported, assert the match; otherwise assert None + assert "python" in result # or assert result is None

🤖 Prompt for AI Agents

In tests/test_security_parser.py lines 125-143, replace the permissive "or" assertions with precise expectations: assert that get_command_for_validation("git", ["ls -la","pwd"]) returns None; assert that get_command_for_validation("cmd", []) returns None; and assert that get_command_for_validation("python", ["/usr/bin/python script.py"]) returns the exact matching segment "/usr/bin/python script.py" (or change expected value to the canonical form your implementation should return), updating the three assertions to use equality/None checks rather than boolean ORs.

coderabbitai · 2025-12-30T16:25:21Z

tests/test_spec_pipeline.py

+    def test_all_skip_words_returns_spec(self, temp_dir: Path):
+        """Returns 'spec' when all words are skip words."""
+        with patch('spec.pipeline.init_auto_claude_dir') as mock_init:
+            mock_init.return_value = (temp_dir / ".auto-claude", False)
+            specs_dir = temp_dir / ".auto-claude" / "specs"
+            specs_dir.mkdir(parents=True, exist_ok=True)
+
+            orchestrator = SpecOrchestrator(project_dir=temp_dir)
+
+            name = orchestrator._generate_spec_name("the a an to for of in on")
+
+            # All words are skip words, should fall back to first words or "spec"
+            assert name == "spec" or len(name) > 0


🧹 Nitpick | 🔵 Trivial

Strengthen assertion to verify exact fallback behavior.

The assertion assert name == "spec" or len(name) > 0 is overly permissive. When all words are skip words, the test should verify the exact fallback behavior.

🔎 Proposed fix

def test_all_skip_words_returns_spec(self, temp_dir: Path): """Returns 'spec' when all words are skip words.""" with patch('spec.pipeline.init_auto_claude_dir') as mock_init: mock_init.return_value = (temp_dir / ".auto-claude", False) specs_dir = temp_dir / ".auto-claude" / "specs" specs_dir.mkdir(parents=True, exist_ok=True) orchestrator = SpecOrchestrator(project_dir=temp_dir) name = orchestrator._generate_spec_name("the a an to for of in on") - # All words are skip words, should fall back to first words or "spec" - assert name == "spec" or len(name) > 0 + # All words are skip words, should fall back to "spec" + assert name == "spec"

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def test_all_skip_words_returns_spec(self, temp_dir: Path):

"""Returns 'spec' when all words are skip words."""

with patch('spec.pipeline.init_auto_claude_dir') as mock_init:

mock_init.return_value = (temp_dir / ".auto-claude", False)

specs_dir = temp_dir / ".auto-claude" / "specs"

specs_dir.mkdir(parents=True, exist_ok=True)

orchestrator = SpecOrchestrator(project_dir=temp_dir)

name = orchestrator._generate_spec_name("the a an to for of in on")

# All words are skip words, should fall back to first words or "spec"

assert name == "spec" or len(name) > 0

def test_all_skip_words_returns_spec(self, temp_dir: Path):

"""Returns 'spec' when all words are skip words."""

with patch('spec.pipeline.init_auto_claude_dir') as mock_init:

mock_init.return_value = (temp_dir / ".auto-claude", False)

specs_dir = temp_dir / ".auto-claude" / "specs"

specs_dir.mkdir(parents=True, exist_ok=True)

orchestrator = SpecOrchestrator(project_dir=temp_dir)

name = orchestrator._generate_spec_name("the a an to for of in on")

# All words are skip words, should fall back to "spec"

assert name == "spec"

🤖 Prompt for AI Agents

In tests/test_spec_pipeline.py around lines 334-346, the assertion `assert name == "spec" or len(name) > 0` is too permissive; change it to assert the exact fallback behavior when all words are skip words. Replace the current assertion with a strict check (e.g., `assert name == "spec"`) so the test verifies the precise expected fallback value, and update any test setup if necessary to ensure deterministic output.

coderabbitai · 2025-12-30T16:25:21Z

tests/test_spec_pipeline.py

+    def test_short_words_filtered(self, temp_dir: Path):
+        """Words with 2 or fewer chars are filtered (except in fallback)."""
+        with patch('spec.pipeline.init_auto_claude_dir') as mock_init:
+            mock_init.return_value = (temp_dir / ".auto-claude", False)
+            specs_dir = temp_dir / ".auto-claude" / "specs"
+            specs_dir.mkdir(parents=True, exist_ok=True)
+
+            orchestrator = SpecOrchestrator(project_dir=temp_dir)
+
+            name = orchestrator._generate_spec_name("Add UI to DB sync feature")
+
+            # "ui" and "db" are 2 chars, should be filtered
+            # "sync" and "feature" should remain
+            assert "sync" in name or "feature" in name


🧹 Nitpick | 🔵 Trivial

Make assertion more specific about filtering behavior.

The assertion uses "or" to check if either "sync" or "feature" appears, but the test's intent is to verify that 2-character words ("ui", "db") are filtered. Consider asserting more precisely what should remain.

🔎 Proposed fix

def test_short_words_filtered(self, temp_dir: Path): """Words with 2 or fewer chars are filtered (except in fallback).""" with patch('spec.pipeline.init_auto_claude_dir') as mock_init: mock_init.return_value = (temp_dir / ".auto-claude", False) specs_dir = temp_dir / ".auto-claude" / "specs" specs_dir.mkdir(parents=True, exist_ok=True) orchestrator = SpecOrchestrator(project_dir=temp_dir) name = orchestrator._generate_spec_name("Add UI to DB sync feature") - # "ui" and "db" are 2 chars, should be filtered - # "sync" and "feature" should remain - assert "sync" in name or "feature" in name + # "ui" and "db" are 2 chars and should be filtered + assert "ui" not in name + assert "db" not in name + # "sync" and "feature" should remain (at least one) + assert "sync" in name and "feature" in name

🤖 Prompt for AI Agents

In tests/test_spec_pipeline.py around lines 374 to 387 the assertion is too weak — it only requires either "sync" or "feature" to be present, which doesn't verify that 2-character words ("ui", "db") were filtered; update the test to assert that the generated name includes the expected longer tokens (e.g., both "sync" and "feature") and explicitly assert that the 2-char tokens "ui" and "db" do NOT appear (or alternately split the name into tokens and assert it equals the expected filtered token list) so the filtering behavior is precisely checked.

coderabbitai · 2025-12-30T16:25:22Z

tests/test_ui_formatters.py

+"""
+Tests for UI Formatters
+========================
+
+Tests for ui/formatters.py - formatting output functions.
+"""
+
+import sys
+from io import StringIO
+from pathlib import Path
+from unittest.mock import patch
+
+sys.path.insert(0, str(Path(__file__).parent.parent / "apps" / "backend"))
+
+


🧹 Nitpick | 🔵 Trivial

Consider consolidating duplicate print_header implementations.

The file apps/backend/integrations/graphiti/test_ollama_embedding_memory.py (lines 77-81) defines its own print_header function, which appears to duplicate functionality from ui.formatters.print_header. Consider refactoring to use the centralized UI formatter utilities to reduce code duplication.

AlexMadera

🤖 Auto Claude PR Review

Merge Verdict: ✅ READY TO MERGE

No blocking issues found

Risk Assessment

Factor	Level	Notes
Complexity	High	Based on lines changed
Security Impact	None	Based on security findings
Scope Coherence	Good	Based on structural review

Findings Summary

Medium: 2 issue(s)
Low: 4 issue(s)

Generated by Auto Claude PR Review

Findings (6 selected of 6 total)

🟡 [MEDIUM] Weak scanner unavailability test

📁 tests/test_git_validators.py:102

The test test_git_commit_scanner_not_available mocks sys.modules['scan_secrets'] = None but this may not properly simulate ImportError since the actual code uses from scan_secrets import get_staged_files, mask_secret, scan_files. The mock should properly raise ImportError on import.

Suggested fix:

Consider using `patch.dict('sys.modules', {'scan_secrets': None})` with a context manager or mock the specific import path in the git_validators module.

🔵 [LOW] Weak assertion - only checks type

📁 tests/test_security_hooks.py:33

Test test_empty_command_allowed asserts isinstance(ok, bool) which passes for any boolean result. The test comment says 'Empty command may pass or fail' but should have a more specific expected behavior.

Suggested fix:

Document expected behavior for empty commands or add more specific assertions based on the actual implementation behavior.

🔵 [LOW] Weak assertion pattern repeated

📁 tests/test_security_hooks.py:56

Multiple tests use isinstance(ok, bool) as the only assertion, which doesn't verify correct behavior. Tests for test_subshell_commands, test_command_with_env_vars, test_multiline_heredoc_style all have this issue.

Suggested fix:

Replace weak type assertions with specific expected values based on implementation behavior.

🔵 [LOW] Truncation test assertion may be fragile

📁 tests/test_ui_boxes.py:79

The test test_box_truncates_long_lines checks for '...' OR a length condition, but the actual box() implementation truncates to inner_width - 5 characters. The assertion len(result.split('\n')[1]) <= 32 may not correctly account for box border characters.

Suggested fix:

Use a more precise assertion that accounts for the box borders and padding in the output.

🟡 [MEDIUM] Large coverage exclusion list

📁 apps/backend/.coveragerc:1

The .coveragerc excludes approximately 60+ module patterns from coverage. While many exclusions are legitimate (SDK dependencies, interactive CLI), this significantly reduces the actual code being measured. The 75% coverage threshold applies only to remaining non-excluded code.

Suggested fix:

Consider adding inline comments in the .coveragerc explaining WHY each category is excluded, and periodically review if any excluded modules become testable.

🔵 [LOW] Test may fail due to empty services dict

📁 tests/test_services_detector.py:128

Test test_no_services_detected asserts 'services' not in analysis or not analysis['services']. However, looking at the source ServicesDetector.detect(), it only adds 'services' key if services dict is non-empty after removing empty categories. The test assertion is correct but the test name and comment could be clearer.

Suggested fix:

Rename test or update docstring to clarify it tests that 'services' key is omitted when no services are detected.

This review was generated by Auto Claude.

cursor · 2025-12-30T17:45:31Z

tests/test_spec_pipeline.py

+            name = orchestrator._generate_spec_name("the a an to for of in on")
+
+            # All words are skip words, should fall back to first words or "spec"
+            assert name == "spec" or len(name) > 0


Test assertion doesn't match test's documented behavior

The test test_all_skip_words_returns_spec claims to verify that "spec" is returned when all words are skip words, but the assertion name == "spec" or len(name) > 0 passes for any non-empty string. The actual generate_spec_name function falls back to using the first 4 original words when all are filtered out, returning "the-a-an-to" for this input, not "spec". The overly permissive assertion means this test won't catch regressions in the documented fallback behavior.

AlexMadera

🤖 Auto Claude PR Review

Merge Verdict: ✅ READY TO MERGE

No blocking issues found

Risk Assessment

Factor	Level	Notes
Complexity	High	Based on lines changed
Security Impact	None	Based on security findings
Scope Coherence	Good	Based on structural review

Findings Summary

Medium: 2 issue(s)
Low: 4 issue(s)

Generated by Auto Claude PR Review

No findings selected for this review.

This review was generated by Auto Claude.

chore: bumb code coverage to 50%

e91711b

github-actions bot added area/backend This is backend only chore Maintenance/chores size/XL Extra large (1000+ lines) 🔄 Checking Checking PR Status labels Dec 30, 2025

github-advanced-security bot found potential problems Dec 30, 2025

View reviewed changes

github-actions bot added 🔄 Checking Checking PR Status and removed 🔄 Checking Checking PR Status labels Dec 30, 2025

code coverage at 81%

72e3307

github-actions bot added 🔄 Checking Checking PR Status and removed 🔄 Checking Checking PR Status labels Dec 30, 2025

AlexMadera and others added 2 commits December 30, 2025 17:17

improve

e1887b6

Merge branch 'develop' into chore-increase-test-coverage

2563282

AlexMadera marked this pull request as ready for review December 30, 2025 16:19

github-actions bot added 🔄 Checking Checking PR Status and removed 🔄 Checking Checking PR Status labels Dec 30, 2025

AlexMadera changed the title ~~chore: Increase backend test code coverage to 75%~~ chore: Increase backend test code coverage to 80% Dec 30, 2025

coderabbitai bot reviewed Dec 30, 2025

View reviewed changes

AlexMadera self-assigned this Dec 30, 2025

AndyMik90 self-assigned this Dec 30, 2025

AlexMadera commented Dec 30, 2025

View reviewed changes

Merge branch 'develop' into chore-increase-test-coverage

73f889c

github-actions bot added 🔄 Checking Checking PR Status and removed 🔄 Checking Checking PR Status labels Dec 30, 2025

cursor bot reviewed Dec 30, 2025

View reviewed changes

AlexMadera commented Dec 31, 2025

View reviewed changes

AlexMadera added the AC: Approved label Dec 31, 2025

Uh oh!

chore: Increase backend test code coverage to 80% #431

Are you sure you want to change the base?

chore: Increase backend test code coverage to 80% #431

Conversation

AlexMadera commented Dec 30, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Base Branch

Description

Related Issue

Type of Change

Area

Commit Message Format

Checklist

CI/Testing Requirements

Screenshots

Feature Toggle

Breaking Changes

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

github-actions bot commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mitsu13Ion commented Dec 30, 2025

Uh oh!

Uh oh!

Check notice

Copilot Autofix

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

AlexMadera left a comment

Choose a reason for hiding this comment

🤖 Auto Claude PR Review

Merge Verdict: ✅ READY TO MERGE

Risk Assessment

Findings Summary

Findings (6 selected of 6 total)

🟡 [MEDIUM] Weak scanner unavailability test

🔵 [LOW] Weak assertion - only checks type

🔵 [LOW] Weak assertion pattern repeated

🔵 [LOW] Truncation test assertion may be fragile

🟡 [MEDIUM] Large coverage exclusion list

🔵 [LOW] Test may fail due to empty services dict

Uh oh!

AlexMadera commented Dec 30, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 30, 2025 •

edited

Loading

github-actions bot commented Dec 30, 2025 •

edited

Loading