Running Apache Zeppelin on Docker

Apache Zeppelin is a fantastic open source web-based notebook. Zeppelin allows users to build and share great looking data visualizations using languages such as Scala, Python, SQL, etc. Running Apache Zeppelin on Docker is a great way to get Zeppelin up and running quickly. Here are the basic steps: Pick an OS Zeppelin runs great

read more

SELECT N Percent of Records in MySQL

The ability to select a certain percentage of records from a query or database table can be important when working with lots of data. Luckily this is easy to do in MySQL with the RAND() function and a WHERE clause. RAND() returns a random floating point number between 0 and 1, making it easy to

read more

How to Select a Random Sample of Records in MySQL

The ability to select a random sample of records from query or database table can be important when working with lots of data. Luckily this is easy to do in MySQL with the RAND() function and a WHERE clause. RAND() returns a random floating point value between 0 and 1, making it very easy to

read more

How to Left Pad Numbers in AWK

Occasionally when working in a Linux/Unix environment comes the need to left pad numbers with zeros (add leading zeros to numbers). This is often done to make a series of numbers the same length. An example of doing this with awk can be seen below:

How to Stop all Running Docker Containers

People that work with Docker know it is easy to create a large number of containers. Occasionally you need to stop all running containers on your machine. This can be accomplished by using docker stop together with docker ps (using command substitution).

docker ps is used to list all running containers. The -q option

read more

How to Merge multiple commits in Git

The ability to merge multiple commits in Git is a huge plus. Merging commits requires “rebasing” which will essentially rewrite the project history appending commits onto the last commit (by default) in a different branch, or even to an earlier commit in the same branch. Rebasing can have some damaging effects, so be careful when

read more

Counting Relationships in Neo4j

When working with different graphs in Neo4j, it is often important to count the number of relationships between different nodes. Below are examples of common relationship counting tasks. The data for these examples can be found here and the script to load this data into Neo4j can be found here. Counting Total Relationships (Edges) MATCH

read more

Delete all Nodes and Relationships in a Neo4j Database

Deleting Nodes and Relationships Deleting all nodes and relationships in a Neo4j database is very simple. Here is an example that does just that: MATCH (n) DETACH DELETE n; The DETACH keyword specifies to remove or “detach” all relationships from a particular node before deletion. If relationships exist on a node at the time deletion

read more

Load a Tab Delimited File into Neo4j

Neo4j is a popular graph database which provides an easy way to import text files using the Cypher query language and a LOAD CSV clause. LOAD CSV by default expects files to be delimited by commas. In order to specify another field delimiter, add FIELDTERMINATOR to the LOAD CSV clause. For example:

The statement

read more