Skip to content

Remove ad network clutter#308

Open
ch040602 wants to merge 1 commit into
kepano:mainfrom
ch040602:feature/remove-ad-network-clutter
Open

Remove ad network clutter#308
ch040602 wants to merge 1 commit into
kepano:mainfrom
ch040602:feature/remove-ad-network-clutter

Conversation

@ch040602

@ch040602 ch040602 commented Jun 8, 2026

Copy link
Copy Markdown

Adds high-confidence ad network selectors to the existing cleanup pass so extracted content drops common Google ad slots, ad iframes, AMP ads, and sponsored recommendation widgets.\n\nThe added coverage focuses on deterministic ad markers such as adsbygoogle, GPT slot IDs, ad data attributes, ad-network iframe hosts, and Taboola/Outbrain/Revcontent widgets while keeping unrelated words like roadmap and adventure intact.\n\nValidation:\n- TZ=UTC npx vitest run tests/ad-removal.test.ts tests/media-removal.test.ts tests/schema-fallback.test.ts\n- TZ=UTC npx vitest run tests/fixtures-normalized.temp.test.ts (temporary local check with CRLF/LF normalized)\n- npm run build

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant