I wanted to process a small subset of data, and not wanting to spin up a cluster, so I used
nagasuga/docker-hive docker image to run Hive on my Mac.
Once I was entered into my hive shell, I created a table for my CSV data:
Creating the Table
1 2 3 4 5 6 7 8 9 10
Loading the Data
My csv data is located at
/resource-data.csv on the container, which I will load into my table:
Query the Data
Just two simple queries for demonstration:
1 2 3 4 5 6 7 8 9 10 11
Thanks to https://github.com/nagasuga/docker-hive
Please feel free to show support by, sharing this post, making a donation, subscribing or reach out to me if you want me to demo and write up on any specific tech topic.
Thanks for reading!