Commit 12fbfc1
fix(community/og): 微信 msg_cdn_url 兜底 + http→https + body cap 16MB (#33)
* fix(community/og): 微信 msg_cdn_url 兜底 + http→https 升级 + body cap 16MB
## 病根
线上 13 条 APPROVED 分享里只有 4 条能正常显示封面:
- 5/7 公众号 og_cover NULL(公众号 head 没 og:image,封面图埋在
inline script 的 var msg_cdn_url JS 变量里,Jsoup meta 选择器扫不到)
- 2/2 小红书 og_cover 是 http://sns-webpic-qc.xhscdn.com/...
(HTTPS 页面被浏览器 mixed-content policy 拦掉)
- id=20 那条公众号触发 "response body exceeded max size"
(head 之前 inline base64 logo + 编辑器 JSON 超过原 8MB 上限)
## 修复
OgFetchService.parseOg 的 cover 查找顺序:
og:image -> twitter:image -> findWeixinCover -> upgradeMediaProtocol
- findWeixinCover: 正则扫 var msg_cdn_url / cdn_url_1_1 / msg_cover_url
强约束开头必须是 http(s):// 防 XSS 注入
- upgradeMediaProtocol: http:// 盲升 https://(mmbiz / xhscdn / zhimg
三大图床都同时支持 https)
- MAX_BODY_BYTES 8MB -> 16MB
前端 sanitizeMediaUrl 加 defense-in-depth 的 http -> https 升级,
万一历史数据 / LLM 兜底回填漏了 https 前端再升一次。
## 测试
OgFetchServiceTests 新增 4 个用例覆盖 WeChat fallback + 协议升级 + XSS 边界,
10/10 通过。
## 历史数据回填
docs/community/og-cover-fallbacks.md 写了三种回填方式(SQL 直升 /
admin refetch API / CommandLineRunner),本次 7 条用 SQL+API 即可。
* fix(community/og)!: WeChat 正则改成数组按优先级顺序匹配(CR PR#33)
Copilot CR 指出原 alternation 正则 `(msg_cdn_url|cdn_url_1_1|msg_cover_url)`
+ Matcher.find() 返回 HTML 文档顺序里**最早出现**的变量,与注释声明的
"msg_cdn_url 优先" 不一致;微信模板偶尔把 cdn_url_1_1 排在前面就会选错。
改成 Pattern[] 数组按优先级顺序逐个 find(),第一个命中即返回。补
findWeixinCover_priorityIndependentOfDocumentOrder 测试锁定行为。
顺手修 docs/community/og-cover-fallbacks.md "两种回填方式" 与下文 A/B/C
三个方式不一致的描述。
Co-authored-by: copilot-pull-request-reviewer[bot] <copilot-pull-request-reviewer[bot]@users.noreply.github.com>
---------
Co-authored-by: copilot-pull-request-reviewer[bot] <copilot-pull-request-reviewer[bot]@users.noreply.github.com>1 parent 48853dd commit 12fbfc1
4 files changed
Lines changed: 329 additions & 8 deletions
File tree
- docs/community
- src
- main/java/com/involutionhell/backend/community/service
- test/java/com/involutionhell/backend/community/service
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
Lines changed: 97 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
| 24 | + | |
| 25 | + | |
24 | 26 | | |
25 | 27 | | |
26 | 28 | | |
| |||
102 | 104 | | |
103 | 105 | | |
104 | 106 | | |
105 | | - | |
106 | | - | |
107 | | - | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
108 | 114 | | |
109 | 115 | | |
110 | 116 | | |
111 | 117 | | |
112 | 118 | | |
113 | 119 | | |
114 | 120 | | |
115 | | - | |
| 121 | + | |
116 | 122 | | |
117 | 123 | | |
118 | 124 | | |
| |||
440 | 446 | | |
441 | 447 | | |
442 | 448 | | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
| 453 | + | |
| 454 | + | |
| 455 | + | |
| 456 | + | |
| 457 | + | |
| 458 | + | |
| 459 | + | |
| 460 | + | |
| 461 | + | |
| 462 | + | |
| 463 | + | |
| 464 | + | |
| 465 | + | |
| 466 | + | |
| 467 | + | |
| 468 | + | |
| 469 | + | |
| 470 | + | |
| 471 | + | |
| 472 | + | |
| 473 | + | |
| 474 | + | |
| 475 | + | |
| 476 | + | |
| 477 | + | |
| 478 | + | |
443 | 479 | | |
444 | 480 | | |
445 | 481 | | |
446 | | - | |
447 | | - | |
| 482 | + | |
| 483 | + | |
| 484 | + | |
| 485 | + | |
| 486 | + | |
| 487 | + | |
448 | 488 | | |
449 | 489 | | |
450 | 490 | | |
| |||
454 | 494 | | |
455 | 495 | | |
456 | 496 | | |
457 | | - | |
| 497 | + | |
458 | 498 | | |
459 | 499 | | |
460 | 500 | | |
461 | 501 | | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
462 | 513 | | |
463 | 514 | | |
464 | 515 | | |
| |||
467 | 518 | | |
468 | 519 | | |
469 | 520 | | |
470 | | - | |
| 521 | + | |
| 522 | + | |
471 | 523 | | |
472 | 524 | | |
473 | 525 | | |
| 526 | + | |
| 527 | + | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
| 534 | + | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
| 541 | + | |
| 542 | + | |
| 543 | + | |
| 544 | + | |
| 545 | + | |
| 546 | + | |
| 547 | + | |
| 548 | + | |
| 549 | + | |
| 550 | + | |
| 551 | + | |
| 552 | + | |
| 553 | + | |
| 554 | + | |
| 555 | + | |
| 556 | + | |
| 557 | + | |
| 558 | + | |
| 559 | + | |
| 560 | + | |
| 561 | + | |
| 562 | + | |
474 | 563 | | |
475 | 564 | | |
476 | 565 | | |
| |||
0 commit comments