Skip to content

Commit 7dc2546

Browse files
authored
Update README.md
1 parent bfa9357 commit 7dc2546

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,10 @@
11
# An Overview of Python's Datatable package
22
Python library for efficient multi-threaded data processing, with the support for out-of-memory datasets.
33

4+
If you are an R user, chances are that you have already been using the data.table package. Data.table is an extension of the data.frame package in R. It's also the go-to package for R users when it comes to the fast aggregation of large data (including 100GB in RAM).
5+
The R's data.table package is a very versatile and a high-performance package due to its ease of use, convenience and programming speed. It is a fairly famous package in the R community with over 400k downloads per month and almost 650 CRAN and Bioconductor packages using it(source).
6+
So, what is in it for the Python users? Well, the good news is that there also exists a Python counterpart to thedata.table package called datatable which has a clear focus on big data support, high performance, both in-memory and out-of-memory datasets, and multi-threaded algorithms. In a way, it can be called as data.table's younger sibling.
7+
48

59
## Installation
610

0 commit comments

Comments
 (0)