Data Analysis using Zeppelin
The Complete Guide to access MDS with HeatWave by Zeppelin
In this guide we will walk through how to install and access
MDS with Heatwave.
What is MDS and HeatWave ?
What is Zeppelin ?
How to Install Zeppelin ?
How to access HeatWave
MDS Data Analysis using Zeppelin
Now, MySQL (only in PaaS model of OCI) comes with HeatWave ,which is
It is enabled when you add a heatwave cluster to a MySQL DB System.
So MySQL database service(MDS) will give faster results and then needed some data analysis tool to make
In this blog let me introduce Apache
Apache Zeppelin , an open
We will explore each item in details…
Overall, My Idea is to
Zepplein can be installed anywhere (any public/private cloud
What is MDS and Heatwave ?
Oracle MySQL Database
MySQL Database Service is the only MySQL cloud service with an
MySQL Database Service is 100% built, managed, and supported by the
more info:- https://www.oracle.com/mysql/
MDS Business benefits :-
What is Zeppelin ?
Zeppelin is web-based
Features of Apache Zeppelin
v Data Ingestion.
v Data Discovery
v Data Analytics
Make sure the port of zeppelin 8080
and in case you are using Oracle Cloud Infrastructure (OCI) then make
To access the Zeppelin, always access with proper user with Interpreter access otherwise any user
Sometime anonymous user get into the portal of Zeppelin by entering
Hence, It is important to have data source authorization in Apache
Note:- I couldn’t dig much in details about security and my major focused
on where you are , how are you accessing the Zeppelin , how eill have seamless
experiences with MDS and HeatWave and do awesome visualization and data
Installation of Zeppelin
In this blog, installation of Zeppelin will be on Oracle
Cloud Infrastructure (https://www.oracle.com/in/cloud/)
sudo yum install
Step 2:- Download the Zeppelin by
Step 3 :- create user and give
sudo adduser -d /home/opc/zeppelin -s /sbin/nologin zeppelin
sudo chown -R zeppelin:zeppelin /home/opc/zeppelin
Step 4:- rename zeppelin site
sudo cp zeppelin-site.xml.template zeppelin-site.xml
sudo cp zeppelin-site.xml.template
Step 5 :- #Start Zeppelin
http://<IP Address of Compute Instance/Local IP>::8080/#/
Connect MDS and Heatwave through Zeppelin
Download MySQL Connector/J
warning: mysql-connector-java-8.0.28-1.el7.noarch.rpm: Header V4 RSA/SHA256
Create a MySQL Interpreter
#Create directory called mds
# mkdir mds
#move the “mysql-connector-java.jar” file into MDS folder
Navigate to the Interpreter
Search for MDS interpreter and
Finally once changes are final
Access data from MDS and Heatwave
Please ensure MDS and
Heatwave is up and running.
Create a notebook and access MDS with Heatwave via Zeppelin
Awesome! , Zeppelin Connected to MDS
Let’s do data analysis by using zeppelin
##Run Below SQL statement to generate report
SELECT airline.airlinename, SUM(booking.price) as
FROM booking, flight, airline, airport_geo
WHERE booking.flight_id=flight.flight_id AND
info about Heatwave:- https://www.oracle.com/mysql/heatwave/
Apache Zeppelin is a kind of tool, which makes Data
Scientist life smooth, they can do everything they need in one place. Things
like data ingestion, data exploration, data visualization, and data analytics
can be done in the zeppelin notebook and MDS HeatWave is a massively parallel,
high performance, in-memory query accelerator for Oracle MySQL Database Service
that accelerates MySQL performance by orders of magnitude for analytics and
mixed workloads and cost lesser than Specialist analytics product like Amazon
Redshift, Aurora ,Snowflake ,Azure Synpase ,google Big Query etc.