Data Hub is a strong data management & orchestration tool for data integration, data processing, and data governance. Data orchestration is composed of reusable pipelines, configurable operations to process data pulled from a variety of sources, including CSV files, XML, web services APIs, hybrid cloud services, and SAP data stores like HANA, BW ABAP Data flow, etc. This blog gives a detailed introduction to SAP Data Hub and Installation of Dev Edition.

Advanced operations can be achieved using analytics or machine learning libraries such as TensorFlow, or custom-coded tasks in datahub.

SAP Data Hub is available in two different editions:

  1. SAP Data Hub – Developer Edition
  2. SAP Data Hub – Trial edition

 

SAP Data Hub, Developer Edition

SAP Data Hub developer edition was first delivered at the end of 2017. Its latest version 2.4 is now available for download.

SAP Data Hub can be installed on any platform that supports Kubernetes.

This includes managed cloud services:

  1. AWS (EKS), GCP (GKE), Azure (AKS)
  2. Private cloud
  3. On-premise installations like SUSE CaaS Platform

SAP Data Hub developer edition can be installed on your local computer with the help of Docker container. SAP Data Hub packaged them together with Hadoop Distributed File System (HDFS), Spark and Livy into a single Docker container image. This container image can be used to initiate options such as SAP Vora Database, SAP Vora Tools, SAP Data Hub Modeler or HDFS, Spark.

 

Limitations of installing SAP Datahub Developer edition on your local computer are:

  1. Data governance and workflow features not being available
  2. Currently, we are facing an issue with using operators in SAP Data Hub developer edition
  3. Operators related to machine learning like TensorFlow and image processing operators OpenCV currently cannot be used in SAP Data Hub developer edition
sap-data-hub-introduction-installation-dev-edition

SAP Datahub Architecture

 

Pre-requisites and hardware requirements

Before getting started with SAP Data Hub Developer Edition installation, please ensure, that the following prerequisites and hardware requirements are met in your local computer.

 

Hardware requirements

  1. 64-Bit Processor with Intel/AMD instruction set “X86_64”
  2. At least 2 CPU Cores (better: 4 Cores) for the purpose of the Developer Edition.
  3. At least 8 GB of RAM for the purpose of the Developer Edition
  4. At least 10 GB disk space for running docker image
  5. Internet Connectivity (temporary)

 

Software requirements

  1. The operating system must support the installation of the Docker (https://www.docker.com)
  2. Docker is available for Windows, MacOS, and Linux

 

Docker installation

Docker is a computer program that performs operating-system-level virtualization. Docker is used to run software packages called containers. Docker provides seamless integration with the Windows operating system.

Please download the docker for Windows with the below link:

https://hub.docker.com/editions/community/docker-ce-desktop-windows

Docker Desktop for Windows is a Docker designed to run both Windows and Linux Docker containers. However, Datahub Developer edition requires the Docker to be switched to “Linux Containers” mode.

Test whether the docker is working properly by running the below command in Linux:

Docker run Test

 

Obtaining SAP Data Hub Developer Edition

Download the Developer Edition with below link and unpack the archive into your local disk:

https://www.sap.com/developer/trials-downloads/additional-downloads/sap-data-hub-developer-edition-15004.html

 

Building Container Image

Steps to build a container image:

  1. Open a terminal window and switch to the directory where you have unpacked the Developer Edition
  2. Issue the command for creation of the base image docker build –tag sapdatahub/dev-edition-base:15.0-01 -f dev-edition-base.Dockerfile
  3. Issue the command for creation of the final image docker builds –tag sapdatahub/dev-edition:2.3

 

Running SAP Data hub Developer Edition

Run the below command to get more information about the usage of the developer edition:

docker run -ti sapdatahub/dev-edition:2.3

 

Supported commands:

CommandPurpose
runstart the SAP Datahub processes in the container
run-hdfsstarts processes related to HDFS and Spark/Livy in container
promptstarts into the bash shell and start further processes manually
networkperforms a network check for accessing public internet sites
  1. The minimal set of parameters to spin up the Developer Edition as container is: docker run sapdatahub/dev-edition:2.3 run –agree-to-sap-license
  2. Run a Docker container
    docker network create dev-net
  3. Followed by (for Linux, Mac)

sap-data-hub-introduction-installation-dev-edition

 

or for Windows

sap-data-hub-introduction-installation-dev-edition

 

Launch SAP Data Hub modeler by running this URL: https://localhost:8090 

 

Start and Stop Data Hub

To start and stop the data hub, use the below commands:

Hostname “Devedition” is not a mandatory name, you can give name based on your needs,

               Docker start Devedition

               Docker Stop Devedition

 

Running HDFS

User can launch HDFS by running the below commands for accessing the Apache Hadoop user interface as shown in the below image:

sap-data-hub-introduction-installation-dev-edition

 

or for windows

sap-data-hub-introduction-installation-dev-edition

 

Launch HDFS by running the below URL:

https://localhost:50070

 

Quick cockpit view of SAP Data Hub Developer Edition

Data Hub consists of below user interface for navigating, creating pipeline and workflow,

sap-data-hub-introduction-installation-dev-edition

 

References:

Installation of SAP Data Hub Dev Edition:

https://developers.sap.com/tutorials/datahub-docker-v2-setup.html

 

Limitation of SAP Data Hub Dev Edition:

https://blogs.sap.com/2017/12/06/sap-data-hub-developer-edition/

https://blogs.sap.com/2017/12/06/faqs-for-sap-data-hub-developer-edition/

 

Read more blogs related to SAP here.

Subscribe to our Newsletter

5920 Windhaven Pkwy
Plano TX 75093.

+1 888-227-2794

+1 972-232-2233

+1 888-227-7192

solutions@visualbi.com