Skip to content

Commit 1c1fd2d

Browse files
committed
Slightly more correct metadata for quantized config.json
1 parent 847a4f7 commit 1c1fd2d

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

conversion/tokenize.py

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,10 @@ def tokenize(job, save_fn, tokenizer, measure = False):
4848
cal_tokens = get_tokens(rows, length, cal_ds, tokenizer)
4949
else:
5050
cal_tokens = get_standard_calibration(job, measure, tokenizer)
51+
if measure:
52+
job["measurement_rows"] = cal_tokens.shape[0]
53+
else:
54+
job["dataset_rows"] = cal_tokens.shape[0]
5155

5256
cal_filename = os.path.join(job["out_dir"], "cal_data.safetensors")
5357
cal_dict = { "input_ids": cal_tokens }

0 commit comments

Comments
 (0)