I wanted to process a small subset of data, and not wanting to spin up a cluster, so I used nagasuga/docker-hive
docker image to run Hive on my Mac.
Running Hive
1 2 |
|
Once I was entered into my hive shell, I created a table for my CSV data:
Creating the Table
1 2 3 4 5 6 7 8 9 10 |
|
Loading the Data
My csv data is located at /resource-data.csv
on the container, which I will load into my table:
1 2 |
|
Query the Data
Just two simple queries for demonstration:
1 2 3 4 5 6 7 8 9 10 11 |
|
Resource:
Thanks to https://github.com/nagasuga/docker-hive
Thank You
Please feel free to show support by, sharing this post, making a donation, subscribing or reach out to me if you want me to demo and write up on any specific tech topic.
Ad space:
Thanks for reading!