You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -308,7 +312,9 @@ <h1 class="title is-1 publication-title">SDAR: Synergy of AutoRegression and Dif
308
312
<h2class="title is-3">TL;DR</h2>
309
313
<divclass="content has-text-justified">
310
314
<p>
311
-
We propose <b>SDAR</b> (<b>S</b>ynergy of <b>D</b>iffusion and <b>A</b>uto<b>R</b>egression), a new language modeling paradigm that synergizes autoregressive and masked discrete diffusion modeling strategies. The model
315
+
We propose <b>SDAR</b> (<b>S</b>ynergy of <b>D</b>iffusion and <b>A</b>uto<b>R</b>egression), a new
316
+
language modeling paradigm that synergizes autoregressive and masked discrete diffusion modeling
317
+
strategies. The model
312
318
series is continuously trained on Qwen3 and has achieved SOTA performance and speed.
We compare the performance of <b>SDAR-30B-A3B-Chat</b> and <b>Qwen3-30B-A3B-AR-SFT</b> under both dynamic and static inference settings. Additionally, we evaluate how varying the threshold in dynamic inference affects speed relative to static inference.
335
+
We compare the performance of <b>SDAR-30B-A3B-Chat</b> and <b>Qwen3-30B-A3B-AR-SFT</b> under both dynamic
336
+
and static inference settings. Additionally, we evaluate how varying the threshold in dynamic inference
337
+
affects speed relative to static inference.
330
338
</p>
331
339
<ul>
332
-
<li>While the static inference speed of SDAR is comparable to that of AR models, its dynamic mode achieves over <b>2×</b> speed-up over its static counterpart with almost no loss in accuracy.</li>
340
+
<li>While the static inference speed of SDAR is comparable to that of AR models, its dynamic mode achieves
341
+
over <b>2×</b> speed-up over its static counterpart with almost no loss in accuracy.</li>
333
342
<li>The speedup effect tends to become more pronounced with increasing model size.</li>
0 commit comments