Skip to content

cognome/bibliometrics-notebooks

Repository files navigation

Bibliometrics Notebooks

The following notebooks were originally written for Spark version 1.5.1.

Pipeline Notebooks

Pos. Title Components Code View Author
01 Geonames Spark, SparkSQL, spark-csv json view Pierre-Alexandre FONTA, Jérémy SUBTIL
11 Crawl data import Spark, SparkSQL json view Jérémy SUBTIL
12 Web of Science scraping Spark, SparkSQL, GraphX, jsoup json view Pierre-Alexandre FONTA, Jérémy SUBTIL
13 Data processing Spark, SparkSQL json view Pierre-Alexandre FONTA, Jérémy SUBTIL
21 Journal disambiguation Spark, SparkSQL json view Pierre-Alexandre FONTA
22 Publication disambiguation Spark, SparkSQL json view Pierre-Alexandre FONTA
23 Author disambiguation Spark, SparkSQL, GraphX json view Jérémy SUBTIL
23 Author disambiguation (PoC) Spark, SparkSQL, GraphX json view Pierre-Alexandre FONTA
24 GeoNames city identification Spark, SparkSQL json view Pierre-Alexandre FONTA
25 Institution names identification Spark, SparkSQL json view Pierre-Alexandre FONTA
31 Publication indicators Spark, SparkSQL json view Pierre-Alexandre FONTA
32 Author indicators Spark, SparkSQL, spark-csv, tinkergraph-gremlin json view Pierre-Alexandre FONTA, Jérémy SUBTIL
33 Institution indicators Spark, SparkSQL, spark-csv, tinkergraph-gremlin json view Pierre-Alexandre FONTA, Jérémy SUBTIL
34 Exports Spark, SparkSQL, spark-csv json view Pierre-Alexandre FONTA, Jérémy SUBTIL