Skip to content

Commit 6398858

Browse files
authored
Update README.md
1 parent 370da1d commit 6398858

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,8 @@ The following table shows the `approximate CPU to memory mapping for AWS Lambda`
5454
It is important to note that the actual amount of CPU that a function receives may vary depending on the workload and other factors. For example, a function that is CPU-intensive will likely receive more CPU than a function that is memory-intensive.
5555
When running a `PySpark script` for a larger file, you can specify a higher memory allocation so that the data can be partitioned and distributed to `multiple vCPUs` for faster processing. This is because `PySpark uses a distributed processing model`, where the data is `divided into smaller chunks` and processed by multiple machines. The more memory that is available, the larger the chunks of data that can be processed at once, which can lead to faster performance.
5656

57-
57+
#### Initialization Duration on AWS Lambda
58+
<p>Our performance testing of SoAL revealed an average initialization duration of 550 ms, which is significantly faster compared to the Spark Cluster mode.</p>
5859

5960
#### DockerFile
6061
<p>The DockerFile builds the image using an AWS based image for Python 3.8. During the build process, it installs PySpark, copies all the required files, and sets the credentials locally on the container. </p>

0 commit comments

Comments
 (0)