Creating Graphs with the Apache Zeppelin Shell Interpreter

Using Apache Zeppelin’s shell (sh) interpreter is a great way to run and share shell scripts. This way you can use all your favorite command line tools like sed and awk to create great looking visualizations using Apache Zeppelin. By default your shell output will not be displayed in a tabular format or leverage Zeppelin’s

read more

Connecting Apache Zeppelin to MySQL

zeppelin-mysql-graph

Apache Zeppelin is a fantastic open source web-based notebook. Zeppelin allows users to build and share great looking data visualizations using languages such as Scala, Python, SQL, etc. A common back end for Zeppelin in MySQL. Here are the steps needed to connect Zeppelin a remote MySQL database: Download the MySQL Connector The first thing

read more

Running Apache Zeppelin on Docker

Apache Zeppelin is a fantastic open source web-based notebook. Zeppelin allows users to build and share great looking data visualizations using languages such as Scala, Python, SQL, etc. Running Apache Zeppelin on Docker is a great way to get Zeppelin up and running quickly. Here are the basic steps: Pick an OS Zeppelin runs great

read more

SELECT N Percent of Records in MySQL

The ability to select a certain percentage of records from a query or database table can be important when working with lots of data. Luckily this is easy to do in MySQL with the RAND() function and a WHERE clause. RAND() returns a random floating point number between 0 and 1, making it easy to

read more

How to Select a Random Sample of Records in MySQL

The ability to select a random sample of records from query or database table can be important when working with lots of data. Luckily this is easy to do in MySQL with the RAND() function and a WHERE clause. RAND() returns a random floating point value between 0 and 1, making it very easy to

read more

How to Left Pad Numbers in AWK

Occasionally when working in a Linux/Unix environment comes the need to left pad numbers with zeros (add leading zeros to numbers). This is often done to make a series of numbers the same length. An example of doing this with awk can be seen below:

How to Stop all Running Docker Containers

People that work with Docker know it is easy to create a large number of containers. Occasionally you need to stop all running containers on your machine. This can be accomplished by using docker stop together with docker ps (using command substitution).

docker ps is used to list all running containers. The -q option

read more

How to Merge multiple commits in Git

The ability to merge multiple commits in Git is a huge plus. Merging commits requires “rebasing” which will essentially rewrite the project history appending commits onto the last commit (by default) in a different branch, or even to an earlier commit in the same branch. Rebasing can have some damaging effects, so be careful when

read more

Counting Relationships in Neo4j

When working with different graphs in Neo4j, it is often important to count the number of relationships between different nodes. Below are examples of common relationship counting tasks. The data for these examples can be found here and the script to load this data into Neo4j can be found here. Counting Total Relationships (Edges) MATCH

read more