Releases: tensorflow/serving
Releases · tensorflow/serving
2.19.1
2.19.0
2.19.0-rc0
Major Features and Improvements
- No major features or improvements.
Breaking Changes
- No breaking changes.
Bug Fixes and Other Changes
- Update release notes for 2.18.0-rc0 (commit: a2253e5)
- cleanup: explicitly set use_java_stubby_library (commit: c22da48)
- cleanup: explicitly set use_java_stubby_library (commit: 5f55e76)
- cleanup: explicitly set use_java_stubby_library (commit: b2be8a3)
- Fix invalid argument to proto_library (commit: 36aa4a6)
- Enable serialization of predict response as tensor content. (commit: 1e16551)
- Prepare code for breaking change in Protobuf C++ API. (commit: 9a38b3a)
- Add WaitResponses to PredictStreamedContext Method. (commit: a0cd2ee)
- Update release notes for 2.17.1 (commit: 8bb2af5)
- Add a new field in PredictRequest. (commit: 166dbe2)
- Update release notes for 2.18.0 (commit: 3134130)
- Adds option to return stop tokens to PredictRequest. (commit: 2ece3e4)
- Remove obsolete anyinvocable flag (commit: ffc6796)
- Move
tsl/platform/{cloud,default,windows}
toxla/tsl/platform
(commit: 2c9e66b) - Use hermetic python version. (commit: d0a16cc)
- Allow HTTP Server to bind to a specific address, as opposed to listening on all addresses. (commit: 8d6cc18)
- Add more tracer (commit: 51f1cf6)
- Add a note about net_http being open-sourced as a standalone library (github.com/google/net_http) (commit: 17186e6)
- Updating TensorFlow to latest passing continuous build. (commit: 04b7d4e)
- Add aggregation key. (commit: 4067640)
- Update users of TSL headers and targets to new location in XLA (commit: 3087ed9)
- Remove usage of tensorflow::BlockingCounter. (commit: f86d984)
- Update users of TSL headers and targets to new location in XLA (commit: 1e25043)
- Force to call SavedMode::Run when output_filter is disabled. Note that the
output filter will still be applied at
PostProcessPredictionResultWithoutOutputFilter
. We just don't prune the
graph in such cases. (commit: 39e33a7) - Integrate BatchFunction op rewrite with serving stack (commit: 88ff3dd)
- add missing error_callback when load is cancelled (commit: e6e78ce)
- Add custom_logging_config so different loggers can introduce their own custom log processing logic. (commit: 4d5c006)
- Update version for 2.19.0-rc0 release. (#4077) (commit: 9a8564a)
- Added github.com/openxla/xla/pull/22572 into tensorflow.patch to resolve breakage. (commit: 72a33bd)
- Fix for gcc10 breakage. (commit: 984448f)
- Revert "Fix for gcc10 breakage." (commit: 69e5f6b)
- Mark Tensorflow compatible with Protobuf v30+. (commit: 8bd5eb8)
- Fix for gcc10 breakage, second attempt. (commit: 428fdde)
- Additional fixes for gcc10. (commit: 021555e)
2.18.1
2.17.1
2.18.0
2.18.0-rc0
Major Features and Improvements
- No major features or improvements.
Breaking Changes
- No breaking changes.
Bug Fixes and Other Changes
- Extend GbmcChannel interface to implement redfish channel for TPUs (commit: 683cb64)
- Add tests to validate monitoring states. (commit: fab5c05)
- Disable xnn_enable_avx256vnnigfni (commit: 19f9ccf)
- Reduce duplicate code using a test class (commit: 51cf3a7)
- Define an option to specify different IFRT client. (commit: aca5cfa)
- Add release notes for tf-serving 2.17.0 (commit: b72a86e)
- avoid SetNumLoadThreads stall the server by forcing reset ThreadPool (commit: 6b9cf7c)
- Add max_enqueued_batches option for model servers (commit: 7c99259)
- Remove gpr_set_log_verbosity from grpc_client.cc (commit: 6e05a38)
- Add option to stop retrying on permanent loading errors. (commit: 9ba72fa)
- Add the batch_padding_policy attribute the tensorflow serving api. (commit: ea02141)
- Improve handling of large JSON objects. (commit: 6cb0131)
- Silence warnings from external code (commit: 010d61a)
- Migration of the histogram header and cc code for TSL. Move tsl/lib/histogram to compiler/tsl/lib/histogram and update users. (commit: ab33df4)
- Add hermetic CUDA repository rule calls to TF serving project. (commit: 787c85f)
- Update users of
status_test_util
to use the new location inxla/tsl
(commit: 22b2b1e) - Bump Bazel version from 6.4.0 to 6.5.0. (commit: 82e532f)
- provide an option to customize the sort order among servable names (commit: 32a85a8)
- Remove cc_api_version stage 4: deletion where cc_api_version = 2 (commit: 7e0c196)
- Remove cc_api_version stage 4: deletion where cc_api_version = 2 (commit: 48e0f56)
- This is a noop comment update for streaming inputs. (commit: cfac240)
- Add a resource kind for number of LoRA models. (commit: 6b7ba27)
- Disable more warnings to make logs cleaner (commit: 4a830ca)
- Add
bool return_single_response
field toPredictStreamedOptions
. (commit: 648c9ee) - Use gcc-10 to avoid build issues while building XLA on CI (commit: 8bd1fda)
- Create separate
kokoro
config (commit: dbc7681) - Remove top-level .bazelrc settings now that scripts use
--config=kokoro
(commit: f920b98) - Update Dockerfile.devel to build with gcc-10 (commit: f9c0262)
- Move
tsl/lib/monitoring
toxla/tsl/lib/monitoring
(commit: cb934df) - Delete 'enable_lazy_split', since the flag is not used anywhere. The code paths for the above flag being false are retained and true are eliminated. This will ensure that improving batching will be easier. (commit: 873993f)
- BUILD rule fix. (commit: d89b272)
- Automated Code Change (commit: 4decd0a)
- Automated Code Change (commit: 0b05e86)
- Fix build error (commit: d341c34)
- Added capability to use XLA on a GPU. (commit: e5e795f)
- Update version for 2.18.0-rc0 release. (#2258) (commit: d6d4022)
- Mark Tensorflow compatible with Protobuf v26+. (#2261) (commit: 424dba4)
- Update version for 2.18.0-rc0 release. (#2262) (commit: 67f4ee8)
- This release is based on TF version 2.18.0-rc2.
2.17.0
Major Features and Improvements
- No major features or improvements.
Breaking Changes
- No breaking changes.
Bug Fixes and Other Changes
- Add RequestOptions and DeterministicMode options. (commit: a8b200b)
- Remove usages of bridge fallback. (commit: 98570a6)
- Provide a runtime option to lower bound the number of batch threads. (commit: 50b07e4)
- Avoid GetChildren when using Specific servable versions (commit: 6fb9403)
- Add python clif target for prediction_log.proto. (commit: 39ba623)
- Build with --xnn_enable_avx512amx=false (commit: f6c4219)
- Update comment in tfrt_saved_model_factory.h for wrong param name. (commit: 14ce911)
- Upgraded libevent to 2.1.12. Fixed minor bug in EvHTTPServer. (commit: 2cda80a)
- Introduce RequestRecorder in tfrt_servable so that implementation can record customized costs and metrics. (commit: 749007b)
- Integrate TFRT+IFRT with tensorflow serving (commit: a8b64dd)
- Add core selector support for TFRT+IFRT serving on tensorflow serving (commit: 84a71a4)
- Remove GPR_ASSERT . (commit: 2dca3af)
- Add timeout support when waiting on servables to load. (commit: 093d841)
- Build with --xnn_enable_avx512fp16=false (commit: eeac086)
- Support paging in TfrtSavedModelServable. (commit: 993a53c)
- Add max_enqueued_batches option for model servers. (commit: d914192)
- Add max_enqueued_batches option for model servers. (commit: 67a2dcb)
- Update version for 2.17.0 release. (#2225) (commit: 68eda92)
- Include patch files necessary for building at TF 2.17 (commit: 6311b72)
- This release is based on TF version 2.17.0.