DataScience is a Python library designed to explore different classes for managing and analyzing data.
- Written by Professor John DeNero, Professor David Culler, Sam Lau, and Alvin Wan
Table of Contents
-
- [Activating Environment]
- [Deactivate Environment]
Clone the following repository
git clone https://github.com/data-8/datascience.git
Use the package manager pip to install the DataScience library.
pip install datascience
This project requires Python 3.6+ and the following key libraries:
matplotlib
numpy
pandas
scipy
pytest
(for testing)
For the full list of dependencies, see the requirements.txt
and environment.yml
Install the dependencies into a new conda environment named datascience.
Activating Environment
source activate datascience
Deactivate Environment
source deactivate
from datascience import (here)
Select the class that you want to use.
from datascience import *
You can select all the classes with " * "
Example
Table().with_columns(
'cars', make_array(1, 2, 3),
'colors', make_array('red', 'blue', 'black')
)