sansa-notebooks

History

Name	Name	Last commit message	Last commit date
parent directory ..
docs/images	docs/images	Moved all content to a subfolder for imminent merge	Oct 6, 2020
examples	examples	Moved all content to a subfolder for imminent merge	Oct 6, 2020
notebook	notebook	Moved all content to a subfolder for imminent merge	Oct 6, 2020
.gitignore	.gitignore	Moved all content to a subfolder for imminent merge	Oct 6, 2020
LICENSE	LICENSE	Moved all content to a subfolder for imminent merge	Oct 6, 2020
Makefile	Makefile	Moved all content to a subfolder for imminent merge	Oct 6, 2020
README.md	README.md	Moved all content to a subfolder for imminent merge	Oct 6, 2020
docker-compose-app.yml	docker-compose-app.yml	Moved all content to a subfolder for imminent merge	Oct 6, 2020
docker-compose.yml	docker-compose.yml	Moved all content to a subfolder for imminent merge	Oct 6, 2020

README.md

SANSA-Notebooks

Interactive Spark Notebooks for running SANSA-Examples. In this repository you will find a docker-compose.yml for running Hadoop/Spark cluster locally. The cluster also includes Hue for navigation and copying file to HDFS. The notebooks are created and run using Apache Zeppelin.

Requirements

Docker Engine >= 1.13.0
docker-compose >= 1.10.0
Around 10 GB of disk space for Docker images

After installation of docker add yourself to docker group (%username% is your username) and relogin:

sudo usermod -aG docker %username%

This allows to run docker commands without sudo prefix (necessary for running make targets).

Getting started

Get the SANSA Examples jar file (requires wget):

make

Start the cluster (this will lead to downloading BDE docker images, will take a while):

make up

When start-up is done you will be able to access the following interfaces:

http://localhost:8080/ (Spark Master)
http://localhost:8088/home (Hue HDFS Filebrowser)
http://localhost/ (Zeppelin)

To load the data to your cluster simply do:

make load-data

Go on and open Zeppelin, choose any available notebook and try to execute it.

To restart Zeppelin without restarting the whole stack:

make restart

Stop the whole stack:

make down

Executing Examples From Command Line

It is also possible to execute the applications from the command line. Get SANSA-Examples jar and start the cluster if you already have not done it:

make
make up
make load-data

Then you can execute any of the following commands to run the examples from the command line:

make cli-triples-reader
make cli-triple-ops
make cli-triples-writer
make cli-pagerank
make cli-rdf-stats
make cli-inferencing
make cli-sparklify
make cli-owl-reader-manchester
make cli-owl-reader-functional
make cli-owl-dataset-reader-manchester
make cli-owl-dataset-reader-functional
make cli-clustering
make cli-rule-mining

How to Contribute

We always welcome new contributors to the project! Please see our contribution guide for more details on how to get started contributing to SANSA.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

sansa-notebooks

sansa-notebooks

README.md

SANSA-Notebooks

Requirements

Getting started

Executing Examples From Command Line

How to Contribute

Files

sansa-notebooks

Directory actions

More options

Directory actions

More options

Latest commit

History

sansa-notebooks

Folders and files

parent directory

README.md

SANSA-Notebooks

Requirements

Getting started

Executing Examples From Command Line

How to Contribute