An Apache Zeppelin Test with Google Sheets

A 5min POC using google sheets in Zeppelin.

Testing how easy it would be to drop data into a spreadsheet, then analyze it using Zeppelin.

First go into https://docs.google.com and create a new spreadsheet, which would be used for collecting your data. Alternatively, my copy of bank.csv, which is used in this example, can be cloned for yours.

Next, you will want to create a shareable link for this spreadsheet, by clicking on [Share] on the upper right hand corner of your Google Sheet:

 

Next, copy the shareable link:

 

If you don’t have access to Zeppelin service, you can initialize one using docker, which would make zeppelin accessible through http://localhost:8080

 

This shareable link ( https://docs.google.com/spreadsheets/d/16VVaWKbCMzzZMZum5BBLQ6jEHn-DAzIvd02NOO5ApP4/edit?usp=sharing )  is broken down into the https://docs.google.com/spreadsheets/d/[sheet-id]/edit?usp=sharing

Next use that ID in the following link format in your Zeppelin notebook, to download and process the CSV as follows:

Next, go ahead and execute your notebook and run queries against the data as you need.