Skip to content

A Python library for introductory data science

License

Notifications You must be signed in to change notification settings

Kamilo408/datascience

 
 

Repository files navigation

Documentation Status Build Status Coverage Status

DataScience

DataScience is a Python library designed to explore different classes for managing and analyzing data.

  • Written by Professor John DeNero, Professor David Culler, Sam Lau, and Alvin Wan

Table of Contents

  1. Installation

  2. Dependencies

  3. Developing

    • [Activating Environment]
    • [Deactivate Environment]
  4. Usage

Installation

GitHub

Clone the following repository

git clone https://github.com/data-8/datascience.git

Using pip

Use the package manager pip to install the DataScience library.

pip install datascience

Dependencies

This project requires Python 3.6+ and the following key libraries:

  • matplotlib
  • numpy
  • pandas
  • scipy
  • pytest (for testing)

Full Dependency List

For the full list of dependencies, see the requirements.txt and environment.yml

Developing

Install the dependencies into a new conda environment named datascience.

Activating Environment

source activate datascience

Deactivate Environment

source deactivate

Usage

from datascience import (here)

Select the class that you want to use.

from datascience import *

You can select all the classes with " * "

Example

Table().with_columns(
    'cars', make_array(1, 2, 3),
    'colors', make_array('red', 'blue', 'black')
)

About

A Python library for introductory data science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 95.7%
  • Python 4.3%