What's Changed
- Adding participation in PyTorch Ecosystem to Readme by @simona-rc in #285
- 🧹 Integrate
nb-clean
by @SagiPolaczek in #287 - added ability to ignore kwargs in function string descriptor building logic, and solved an issue in run_multiprocessed in verbose=0 case by @YoelShoshan in #286
- perplexity impl + post collect func process fix by @mosheraboh in #288
- Make logging optional by @shatz01 in #289
- few fixes to enable pl2 - and limit pl version until all tests will … by @mosheraboh in #291
- Support for pl2 in simple mnist example by @shatz01 in #294
- multiprocessing improvements by @YoelShoshan in #296
- Shatz batch sampler by @shatz01 in #299
- fixed num_available_cores to work correctly in distributed setting in LSF (using cores affinity) by @YoelShoshan in #300
- cindex: make pred np.array to allow negation by @michalozeryflato in #298
- add backbone mvit by @liamhazan in #293
- added an op to replace elements of value X with value Y - supporting numpy array, torch tensors, lists, and strings by @YoelShoshan in #302
- Support Lightning >=2.0.0 and Pandas >=2.0.0 by @SagiPolaczek in #301
- make components more general by @mosheraboh in #305
mypy
typing fix by @SagiPolaczek in #304- resnet 3d with first channels dim != 64 bugfix by @itaijj in #308
- Resnet3d bug fix by @itaijj in #309
- add a flag to metric to avoid multigpu collection by @mosheraboh in #311
- Fix ISIC's ResNet run (Poriya) by @SagiPolaczek in #312
- Prep for long train by @mosheraboh in #313
- small fix. more intuitive condition for main process run by @alex-golts in #315
- update resnet with pool option, added contrastive supervised loss by @shakedpe in #314
- new picai seg by @itaijj in #310
- Shaked fuse by @shakedpe in #317
- cleaning and reorganizing fuse core code on segmentation by @itaijj in #318
- Refactoring:
.flake8
&.mypy.ini
->setup.cfg
by @SagiPolaczek in #319 - multiple validation dataloaders in common lightning module by @mosheraboh in #321
- Remove
pyproject.toml
by @SagiPolaczek in #323 - add an option in CollateDefault to add fixed items to batch_dict by @mosheraboh in #324
- add weight to extract loss class by @mosheraboh in #325
- Shaked fuse by @shakedpe in #327
- OpFunc: input=None passes the entire sample_dict by @michalozeryflato in #326
- Support a better string representation for
NDict
by @SagiPolaczek in #328 - add ignore_index to auc_roc metric by @michalozeryflato in #329
- adding torch_utils with select method similar to numpy by @michalozeryflato in #331
- clearml logger + improved mnist starter by @shatz01 in #330
- Added
__len__
toPipelineDefault
by @SagiPolaczek in #333 - Bfloat16 support by @mosheraboh in #334
- add a method to add sample ids to a dataset by @michalozeryflato in #332
- OpSetRandomChoice by @mosheraboh in #335
- Make
op_id
a really optional argument inOpCast
by @SagiPolaczek in #337 - pearson correlation metric by @alex-golts in #338
- Added readthedocs config for documentation by @rakesh9177 in #339
- Make
NDict
(hopefully) lighter to import by @SagiPolaczek in #342 - Support TSV files in
file_io
by @SagiPolaczek in #341 - Add OpReplaceAnyElements to ops_common by @sivanravidos in #340
- Crop padding in collate by @liamhazan in #343
- support pl module test step by @sivanravidos in #344
- Perplexity gpu mem optimization by @mosheraboh in #345
- adds count num sequences and tokens metric by @mosheraboh in #346
- shared memory utility helping to speedup reading from files significantly if enough RAM is available by @YoelShoshan in #347
- Crop pad fix by @IdoAmosIBM in #348
- Bump GitHub workflows' python version to
3.9
by @SagiPolaczek in #350 - add workaround to support fsdp by @mosheraboh in #353
- collate now supports minimal dimension + added more support in samples cacher + added interface validation utility by @YoelShoshan in #349
- MetricCountSeqAndTokens counst tokens in label (if exist) + improving epoch metrics print by @michalozeryflato in #352
- implement hdf5 dataset reader op by @mosheraboh in #356
- use scipy implementation for Pearson correlation by @alex-golts in #355
- Matching keys lengths and new metrics by @IdoAmosIBM in #358
- min_size_per_dim + changes for scalars inputs+outputs support by @YoelShoshan in #354
- new OpRandCrop for augmentation by @bensha6757 in #360
- Oai example by @liamhazan in #361
- fixed bug of specificity calculation by @ellabarkan in #359
- clearml offline mode by @mosheraboh in #363
- added new metrics for regression tasks by @IdoAmosIBM in #364
- to backward support packages importing metrics from a previous sub module by @YoelShoshan in #365
- move modular tokenizer to fuse by @mosheraboh in #366
- support custom configure_optimizers method and avoid from saving random model by default by @mosheraboh in #367
- avoid from gpu sync attempt when using single gpu by @mosheraboh in #369
- Integrate
ModularTokenizerOp
with Hugging Face remote 🤗 by @SagiPolaczek in #368 - multi test dataloaders by @mosheraboh in #370
- adding r2 metric by @mosheraboh in #371
- Add new tokens to tokenizer op by @matanninio in #373
- rmse score fix by @mosheraboh in #372
- advancing to scalars gen2 by @YoelShoshan in #374
- Tokenizer's default behavior on unknown token by @SagiPolaczek in #375
- Bug fix in
ModularTokenizer.decode()
when the input istorch.Tensor
type by @SagiPolaczek in #376 - Bump python version for MAMMAL release by @SagiPolaczek in #377
New Contributors
- @shakedpe made their first contribution in #314
- @rakesh9177 made their first contribution in #339
- @sivanravidos made their first contribution in #340
- @IdoAmosIBM made their first contribution in #348
- @bensha6757 made their first contribution in #360
- @matanninio made their first contribution in #373
Full Changelog: 0.3.0...0.4.0