Releases: biocore/unifrac-binaries
Releases · biocore/unifrac-binaries
Improve subsampling support
Auto-detect CPU capabilities at runtime in Linux
What's Changed
- Tune the build procedure, use AVX2 for performance build by @sfiligoi in #41
- Auto-detect CPU capabilities at runtime in Linux by @sfiligoi in #42
Full Changelog: v1.3.1...v1.3.2
Fix memory leak and improve build procedure
Add support for PERMANOVA, subsampling and the multi mode
Better GPU detection and PCoA-only output
Change default precision and deprecate -n
What's Changed
- Fix static hdf5 linking by @sfiligoi in #15
- Reprecate -n and add --n-substeps by @sfiligoi in #18
- Rename some of the variables to clarify semantics. No change in behavior by @sfiligoi in #19
- Change default method precision to fp32, and add explicit _fp64 equivalent by @sfiligoi in #20
- add citation by @wasade in #16
- Request build on linux-gpu-cuda and cleanup code by @sfiligoi in #22
- Reduce disk usage of the GPU compiler by @sfiligoi in #23
Full Changelog: v1.1.3...v1.2
Bug fix and speed improvements
A buffer overflow bug was fixed.
Moreover, the Unweighted Unifrac has been significantly sped up and there has been additional CPU speedups across the board, too.
What's Changed
- Add precomputed sums for Unweighted + bug fix by @sfiligoi in #13
- Add tiling to speed CPU code by @sfiligoi in #14
Full Changelog: v1.1.2...v1.1.3
CPU optimizations
CPU optimizations yielding 1.5x speedup across the board on CPUs, no change to GPU performance.
What's Changed
- Add CPPFLAGS to R compile by @sfiligoi in #7
- Add support for UNIFRAC_TIMING_INFO env variable by @sfiligoi in #8
- Vectorize Weighted unifrac for CPUs by @sfiligoi in #9
- Add larger unifrac tests by @sfiligoi in #10
- Use plain mkstemp() if O_NOATIME is not available by @jmarshall in #6
- More aggressive vectorization of Weighted by @sfiligoi in #11
- Add zero checking to Unweighted for CPU code by @sfiligoi in #12
New Contributors
- @jmarshall made their first contribution in #6
Full Changelog: v1.1.1...v1.1.2
Address performance regression
The switch from 0.20.3 -> 1.0.0 introduced a 10% performance regression on CPUs by compiling for general targets. This minor release addresses the build change.
Expose in-memory API options
With this release, we introduce the ability to call the API using in-memory objects instead of requiring on-disk representations. We also parallelize a few components of the BIOM constructor.