WASM Implementation Plan for Scam Detector

Reviewed 2026-02-02: API verified against fasttext.wasm.js v1.0.0 source. Fixed Pair access ([0]/[1] not .first/.second).

Updated 2026-02-02: Applied 4 fixes from pi review: (1) scan predictions for __label__scam explicitly, (2) call predictions.delete() to avoid WASM heap leak, (3) manifest pattern fastText/** for subfolders, (4) filename case verified correct.

Overview

Load the trained fastText model (quant-cutoff100k.ftz, 5.72MB) in the browser extension using WebAssembly.

Recommended Library

fasttext.wasm.js (v1.0.0)

✅ TypeScript support
✅ Explicit browser extension support
✅ Custom model loading
✅ Active maintenance (2024)
✅ ~423KB WASM binary (very reasonable!)

Architecture

extension/
├── vendor/
│   └── fasttext/                   # fasttext.wasm.js assets
├── fasttext/
│   └── fasttext_wasm.wasm          # WASM binary
├── models/
│   └── quant-cutoff100k.ftz        # Our model (5.72MB)
├── background.js                   # Background service worker
├── content-script.js               # Content script
└── manifest.json

Implementation Steps

Phase 1: Dependencies & Setup

cd extension
npm install fasttext.wasm.js

Phase 2: Extract WASM Assets

Copy from node_modules/fasttext.wasm.js:

dist/core/fastText.common.wasm → public/fastText/fastText.common.wasm

Or use a build script to copy automatically:

mkdir -p extension/public/fastText/models
cp node_modules/fasttext.wasm.js/dist/core/fastText.common.wasm extension/fasttext/
cp models/reduced/quant-cutoff100k.ftz extension/models/quant-cutoff100k.ftz

WASM Size: 423KB Model Size: 5.72MB Total: ~6.1MB (still fine for a browser extension)

Phase 3: Create ScamDetector Wrapper

src/lib/scam-detector.ts:

import {
  getFastTextModule,
  getFastTextClass,
  type FastTextModel,
} from "fasttext.wasm.js/common";

const PRODUCTION_THRESHOLD = 0.6151;

export interface PredictionResult {
  isScam: boolean;
  confidence: number;
  label: string;
  rawProbability: number;
}

export class ScamDetector {
  private model: FastTextModel | null = null;
  private loaded = false;

  async load(options?: {
    wasmPath?: string;
    modelPath?: string;
  }): Promise<void> {
    if (this.loaded) return;

    const wasmPath =
      options?.wasmPath ??
      chrome.runtime.getURL("fastText/fastText.common.wasm");
    const modelPath =
      options?.modelPath ??
      chrome.runtime.getURL("models/quant-cutoff100k.ftz");

    // Step 1: Initialize the WASM module with custom path
    const getFastTextModuleWithPath = () => getFastTextModule({ wasmPath });

    // Step 2: Get the FastText class
    const FastText = await getFastTextClass({
      getFastTextModule: getFastTextModuleWithPath,
    });

    // Step 3: Load the model
    const ft = new FastText();
    this.model = await ft.loadModel(modelPath);
    this.loaded = true;
  }

  async predict(
    text: string,
    threshold = PRODUCTION_THRESHOLD,
  ): Promise<PredictionResult> {
    if (!this.model) {
      throw new Error("Model not loaded. Call load() first.");
    }

    // fastText predict returns Vector<Pair<number, string>> where Pair is a tuple [number, string]:
    // - [0] is probability
    // - [1] is label (e.g., "__label__scam")
    // Use -1 to get ALL labels (don't assume binary normalized probabilities!)
    const predictions = this.model.predict(text, -1);

    try {
      // Scan predictions for __label__scam explicitly
      // (fastText doesn't guarantee p(scam)+p(not_scam)=1)
      let scamProb = 0;
      for (let i = 0; i < predictions.size(); i++) {
        const pair = predictions.get(i);
        const prob = pair[0];
        const label = pair[1];
        if (label === "__label__scam") {
          scamProb = prob;
          break;
        }
      }

      return {
        isScam: scamProb >= threshold,
        confidence: scamProb >= threshold ? scamProb : 1 - scamProb,
        label: scamProb >= threshold ? "scam" : "not_scam",
        rawProbability: scamProb,
      };
    } finally {
      // IMPORTANT: Free embind Vector to avoid WASM heap leak
      predictions.delete();
    }
  }

  isLoaded(): boolean {
    return this.loaded;
  }
}

// Singleton instance for background script
let instance: ScamDetector | null = null;

export async function getScamDetector(): Promise<ScamDetector> {
  if (!instance) {
    instance = new ScamDetector();
    await instance.load();
  }
  return instance;
}

Note: The predict() method returns a C++ Vector wrapper. Access elements with .get(i). Pair types are tuples [T1, T2] - access with [0]/[1], NOT .first/.second! IMPORTANT: Call predictions.delete() after use to free WASM heap memory!

Phase 4: Background Script Integration

src/background.ts:

import { getScamDetector, PredictionResult } from "./lib/scam-detector";

// Initialize on install/startup
chrome.runtime.onInstalled.addListener(async () => {
  console.log("[ScamDetector] Initializing...");
  const detector = await getScamDetector();
  console.log("[ScamDetector] Model loaded!");
});

// Message handler for content scripts
chrome.runtime.onMessage.addListener((message, sender, sendResponse) => {
  if (message.type === "CHECK_SCAM") {
    (async () => {
      try {
        const detector = await getScamDetector();
        const result = await detector.predict(message.text);
        sendResponse({ success: true, result });
      } catch (error) {
        sendResponse({ success: false, error: String(error) });
      }
    })();
    return true; // Keep channel open for async response
  }
});

Phase 5: Content Script Usage

src/content.ts (example):

async function checkText(text: string) {
  const response = await chrome.runtime.sendMessage({
    type: "CHECK_SCAM",
    text,
  });

  if (response.success && response.result.isScam) {
    console.warn("🚨 Potential scam detected:", response.result);
    // Show warning UI
  }
}

Phase 6: Manifest Configuration

manifest.json additions:

{
  "permissions": [],
  "web_accessible_resources": [
    {
      "resources": ["fastText/**"],
      "matches": ["<all_urls>"]
    }
  ],
  "background": {
    "service_worker": "background.js",
    "type": "module"
  }
}

Files to Create/Modify

File	Action	Description
`extension/package.json`	Modify	Add `fasttext.wasm.js` dependency
`extension/public/fastText/fastText.common.wasm`	Create	Copy WASM binary
`extension/models/quant-cutoff100k.ftz`	Create	Copy our model
`extension/src/lib/scam-detector.ts`	Create	ScamDetector wrapper class
`extension/src/background.ts`	Modify	Initialize detector, handle messages
`extension/manifest.json`	Modify	Add web_accessible_resources

Configuration

Parameter	Value	Notes
Threshold	0.6151	Tuned for <=2% FPR on holdout
Model	`quant-cutoff100k.ftz`	5.72MB, quantized
WASM	`fastText.common.wasm`	423KB

Total bundle size: ~6.1MB - still reasonable for a browser extension.

UI highlights

scam: red outline/background
crypto: orange outline/background
promo: blue outline/background
Label badge shows top labels on highlighted elements; tooltip includes top scores.

Potential Gotchas

Vector/Pair API: fasttext.wasm.js uses C++ STL-like containers. Vector uses .get(i), .size(), and .delete() for cleanup. Pair is a tuple [T1, T2] - access with [0]/[1], NOT .first/.second!
WASM Loading: In Manifest V3 service workers, ensure WASM is loaded correctly. May need to use chrome.runtime.getURL() for paths.
Model Path: The loadModel() method expects a URL, not a file path. Use absolute URLs in browser context.
Memory / Heap Leak: WASM modules can be memory-intensive. Critical: embind objects like Vector returned by predict() must be freed with .delete() or you'll leak WASM heap memory! Use try/finally to ensure cleanup.
Service Worker Lifecycle: Background service workers can be terminated. Model may need to be re-loaded on wake.
Probability Values: fastText doesn't guarantee p(scam) + p(not_scam) = 1. Don't use 1 - topProb as the complement. Instead, scan predictions for the specific label you want (use predict(text, -1) to get all labels). If you keep multiple scam labels, combine them explicitly.
Quantized overshoot: quantized models may return probabilities slightly > 1. Clamp to [0, 1] before comparisons.
CSP: inline scripts are blocked on extension pages. Load smoke-test JS from an external file (e.g., wasm-smoke.js).
Multi-label: fastText supports multiple labels per sample. At inference, treat each label probability independently and return all labels above your threshold(s).

Testing Checklist

Model loads successfully in background script
Prediction returns correct format (check Vector/Pair access)
Threshold of 0.6151 works correctly
Content script can communicate with background
No CORS issues with WASM/model loading
Memory usage is acceptable
Service worker restart handles model reload

Example Predictions

await detector.predict("FREE AIRDROP! Connect wallet now!");
// → { isScam: true, confidence: 0.999, label: 'scam', rawProbability: 0.999 }

await detector.predict("The meeting is scheduled for tomorrow at 3pm");
// → { isScam: false, confidence: 0.95, label: 'not_scam', rawProbability: 0.05 }

References

fasttext.wasm.js
Browser extension example
WXT Framework (optional, for easier extension development)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WASM Implementation Plan for Scam Detector

Overview

Recommended Library

Architecture

Implementation Steps

Phase 1: Dependencies & Setup

Phase 2: Extract WASM Assets

Phase 3: Create ScamDetector Wrapper

Phase 4: Background Script Integration

Phase 5: Content Script Usage

Phase 6: Manifest Configuration

Files to Create/Modify

Configuration

UI highlights

Potential Gotchas

Testing Checklist

Example Predictions

References

FilesExpand file tree

WASM_PLAN.md

Latest commit

History

WASM_PLAN.md

File metadata and controls

WASM Implementation Plan for Scam Detector

Overview

Recommended Library

Architecture

Implementation Steps

Phase 1: Dependencies & Setup

Phase 2: Extract WASM Assets

Phase 3: Create ScamDetector Wrapper

Phase 4: Background Script Integration

Phase 5: Content Script Usage

Phase 6: Manifest Configuration

Files to Create/Modify

Configuration

UI highlights

Potential Gotchas

Testing Checklist

Example Predictions

References