Datahive

Datahive is an ingenious, configuration-driven end-to-end data pipeline solution that simplifies the complexities of managing data workflows. Harnessing the power of Kafka, Hadoop, Apache Spark, Elasticsearch, Kibana, and an intuitive UI, Datahive empowers users to effortlessly manage and monitor their data stacks.

Features

🚀 Streamlined data pipeline setup
☕ Automated data processing while you enjoy your coffee
📊 Utilizes Kafka, Hadoop, Apache Spark, Elasticsearch, and Kibana
🛠️ Easy configuration through YAML files
🔄 Supports both stream and batch processing

How it Works

Define your data pipeline effortlessly using a simple YAML configuration file. Specify input and output schemas for each service, and let Datahive handle the rest. Below is a sample configuration for stream processing:

type: stream
kafka:
    - inTopic: <your-topic-name>
      outTopic: <your-topic-name>
      hdfs: false
      transform: | 
        def transform(record) {
            def jsonObject = record
            // do your transformation logic in a groovy script
            return jsonObject
        }
    - inTopic: <your-topic-name>
      hdfsFileName: <your-hdfs-filename>
      hdfs: true

spark:
    - app-resource: <path-for-your-spark-build-file>
      driver.memory: 1g
      executor.memory: 2g
    - app-resource: <path-for-your-second-spark-build-file>
      driver-memory: 1g
      executor-memory: 2g
      res-location: <path-for-the-spark-job-code>
      main-class: <main-class-of-your-spark-job>
      job-name: <name-of-your-job>

elasticsearch:
    - 

kibana:
    dashboard-config:

Getting Started

Clone the repository.
Install the required dependencies.
Configure Datahive using the provided YAML files.
Run the application.

Screenshots

Home Page	Features

Highlights	Login Page

Dashboard	WorkerStats

Datahive Stack Stats	Alerts

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
app		app
docker		docker
examples		examples
hack		hack
k8s		k8s
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
temp.yaml		temp.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Datahive

Features

How it Works

Getting Started

Screenshots

About

Releases 1

Packages

Languages

License

srikanth-iyengar/datahive

Folders and files

Latest commit

History

Repository files navigation

Datahive

Features

How it Works

Getting Started

Screenshots

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages