WIP: Optimize OpenEMR 7.0.5 container startup and memory usage #509

Jmevorach · 2025-11-26T22:12:35Z

Refactors the OpenEMR 7.0.5 Docker image for significantly improved startup performance and reduced memory footprint. This is a work-in-progress that requires community testing to verify all container functionality before merging.

Performance Improvements

Benchmarking against the current Docker Hub image (openemr/openemr:7.0.5) shows:

Metric	Optimized	Original	Improvement
Startup Time	15.0s	73.1s	4.9x faster
Memory (Avg)	92.8 MB	304.2 MB	69% reduction
Memory (Peak)	117.1 MB	326.6 MB	64% reduction
Throughput	114.9 req/s	117.2 req/s	Equivalent

Technical Changes

Build-Time Permission Optimization (Primary)

Moved file permission operations (chmod/chown) from runtime to build time
The original container scanned and modified ~15,000 files on every startup (40-60s)
Now permissions are baked into the image; only setup-modified files need runtime adjustment

Startup Script Modernization

Rewrote openemr.sh from POSIX sh to bash with set -euo pipefail
Added comprehensive inline documentation and clear section organization
Improved database wait logic with exponential backoff
Simplified permission handling for runtime efficiency

Dockerfile Documentation

Added extensive comments explaining each build stage
Documented package purposes, permission schemes, and build targets

Testing Needed

This PR needs help verifying that all OpenEMR container functions work correctly:

How to Test

Build and test the optimized image:
cd docker/openemr/7.0.5

docker build -t openemr:7.0.5-optimized .
docker compose up -d

Or use the benchmarking utility to compare performance: cd utilities/container_benchmarking
./benchmark.sh

Related Files

docker/openemr/7.0.5/* - Optimized container files
utilities/container_benchmarking/ - Benchmarking suite for performance validation

Summary of the Container Benchmarking Utility

The Container Benchmarking Utility (utilities/container_benchmarking/) is a valuable development tool for the OpenEMR project:

What It Does

Compares two container images side-by-side (local build vs. Docker Hub reference)
Measures key performance metrics:
- Startup time: How long until the container is healthy
- Throughput: Requests per second under load (Apache Bench)
- Resource usage: CPU and memory during operation

Why It's Useful for OpenEMR Container Development

Quantifies optimizations: Instead of guessing, you get hard numbers (e.g., "4.9x faster startup")
Prevents regressions: Compare any changes against the baseline Docker Hub image
Quick feedback loop: Full benchmark suite runs in 3-5 minutes
Reproducible results: Documented methodology with timestamp-organized output
Analysis tools: Includes summary.sh, compare_results.sh, and CSV export for deeper analysis

Recommended Use Cases

Before PRs: Verify performance impact of container changes
CI/CD integration: Automated regression detection in GitHub Actions
Version comparisons: Test across OpenEMR versions (7.0.3, 7.0.4, 7.0.5, etc.)
Optimization validation: Prove that changes actually improve performance

The utility democratizes performance testing—any contributor can run benchmarks locally and share quantified results, making it easier to evaluate and merge container improvements.

Refactor the OpenEMR 7.0.5 Docker image for significantly improved startup performance and reduced memory footprint. This is a work-in-progress that requires community testing to verify all container functionality before merging. ## Performance Improvements Benchmarking against the current Docker Hub image (`openemr/openemr:7.0.5`) shows: | Metric | Optimized | Original | Improvement | |-----------------|--------------|-------------|------------------| | Startup Time | 15.0s | 73.1s | **4.9x faster** | | Memory (Avg) | 92.8 MB | 304.2 MB | **69% reduction**| | Memory (Peak) | 117.1 MB | 326.6 MB | **64% reduction**| | Throughput | 114.9 req/s | 117.2 req/s | Equivalent | ## Technical Changes ### Build-Time Permission Optimization (Primary) - Moved file permission operations (`chmod`/`chown`) from runtime to build time - The original container scanned and modified ~15,000 files on every startup (40-60s) - Now permissions are baked into the image; only setup-modified files need runtime adjustment ### Startup Script Modernization - Rewrote `openemr.sh` from POSIX sh to bash with `set -euo pipefail` - Added comprehensive inline documentation and clear section organization - Improved database wait logic with exponential backoff - Simplified permission handling for runtime efficiency ### Dockerfile Documentation - Added extensive comments explaining each build stage - Documented package purposes, permission schemes, and build targets ## Testing Needed This PR needs help verifying that all OpenEMR container functions work correctly: - [ ] Fresh installation (auto-configure) - [ ] Manual setup mode - [ ] SSL/TLS certificate configuration - [ ] Redis session handling - [ ] Kubernetes mode (admin/worker roles) - [ ] Swarm mode coordination - [ ] Upgrade path from previous versions - [ ] XDebug configuration - [ ] Let's Encrypt integration - [ ] Document upload/storage - [ ] Multi-site configurations ## How to Test Build and test the optimized image: cd docker/openemr/7.0.5 docker build -t openemr:7.0.5-optimized . docker compose up -dOr use the benchmarking utility to compare performance: cd utilities/container_benchmarking ./benchmark.sh## Related Files - `docker/openemr/7.0.5/*` - Optimized container files - `utilities/container_benchmarking/` - Benchmarking suite for performance validation --- ## Summary of the Container Benchmarking Utility The **Container Benchmarking Utility** (`utilities/container_benchmarking/`) is a valuable development tool for the OpenEMR project: ### What It Does - **Compares two container images** side-by-side (local build vs. Docker Hub reference) - **Measures key performance metrics**: - **Startup time**: How long until the container is healthy - **Throughput**: Requests per second under load (Apache Bench) - **Resource usage**: CPU and memory during operation ### Why It's Useful for OpenEMR Container Development 1. **Quantifies optimizations**: Instead of guessing, you get hard numbers (e.g., "4.9x faster startup") 2. **Prevents regressions**: Compare any changes against the baseline Docker Hub image 3. **Quick feedback loop**: Full benchmark suite runs in 3-5 minutes 4. **Reproducible results**: Documented methodology with timestamp-organized output 5. **Analysis tools**: Includes `summary.sh`, `compare_results.sh`, and CSV export for deeper analysis ### Recommended Use Cases - **Before PRs**: Verify performance impact of container changes - **CI/CD integration**: Automated regression detection in GitHub Actions - **Version comparisons**: Test across OpenEMR versions (7.0.3, 7.0.4, 7.0.5, etc.) - **Optimization validation**: Prove that changes actually improve performance The utility democratizes performance testing—any contributor can run benchmarks locally and share quantified results, making it easier to evaluate and merge container improvements.

Fixes shellcheck issues in compare_results.sh

kojiromike

This is great. I'm looking forward to getting more benchmarking data out of the gate.
I hope you don't mind my reviewing it early despite the WIP marker. I just don't know when I'll get another chance.

Mainly my comments are around taking better advantage of bash. If we're not going to try to be POSIX compliant then we should take full advantage of bash features, particularly integer arithmetic and shopts like nullglob, globstar and maybe extglob.

The other stuff is about taking better advantage of current docker buildtime features like caching.

And maybe it's not a conversation for today, but maybe we don't need to have bash, python and php on the image. Some day, we should consider rewriting these tools in a single language. We can take advantage of Symfony CLI and other libs already included in openemr to avoid adding more overhead or dependency management.

kojiromike · 2025-11-27T12:18:30Z

docker/openemr/7.0.5/Dockerfile

 RUN apk add --no-cache build-base \
+    # Clone OpenEMR repository (shallow clone to reduce image size)
    && git clone https://github.com/openemr/openemr.git --depth 1 \
    && rm -rf openemr/.git \


If we're cloning master and nuking .git, we could fetch the archive from GitHub and unpack it instead.

kojiromike · 2025-11-27T12:20:11Z

docker/openemr/7.0.5/Dockerfile

+    # Install PHP dependencies (production mode, no dev dependencies)
    && composer install --no-dev \
+    # Install Node.js dependencies for frontend build
    && npm install --unsafe-perm \


npm install/build could be done in a previous stage (docker multistage) and only the assets copied over.

kojiromike · 2025-11-27T12:23:08Z

docker/openemr/7.0.5/Dockerfile

    && cd ../ \
+    # Clean up vendor and asset files using Phing build tool
    && composer global require phing/phing \
    && /root/.composer/vendor/bin/phing vendor-clean \


Doing composer install in a stage and copying only the needed bits could also reduce the need for manual cleaning applications.

kojiromike · 2025-11-27T12:25:09Z

docker/openemr/7.0.5/Dockerfile

    && composer dump-autoload --optimize --apcu \
+    # Clear Composer and npm caches to reduce image size
    && composer clearcache \
    && npm cache clear --force \


We should take advantage of docker's support for build-time caching instead of disabling or nuking them at build time.

OpenCoreEMR can share how we're doing this for our own builds some time next week. Cc: @msummers42

kojiromike · 2025-11-27T12:27:25Z

docker/openemr/7.0.5/Dockerfile

-# Install kcov dependencies
+# Install kcov build dependencies
+# kcov is a code coverage tool that requires compilation from source
 RUN apk add --no-cache bash \


We have bash above, now.

kojiromike · 2025-11-27T14:36:47Z