How to convert a Parquet (gzip) file to a CSV for use in spreadsheet and GIS software.
1.0 Introduction
Urban SDK offers the ability to import Parquet (gzip) data files from Data Hub into Studio to combine data from multiple different Insights Reports, as well as export Parquet files from Studio into CSV format.
1.1 What is a Parquet (.gzip) file?
Parquet data format is a column-oriented format of data storage, as opposed to the row-based format that comma separated variable (CSV) files use. This means the data can be compressed into significantly smaller files sizes, as well as be read more efficiently to improve performance while reviewing Insights Reports.
File Size Comparison:
Location | Data | CSV file size | Parquet (gzip) file size |
Duval, County, Florida | Travel Time Index | 20 mb | 2.5 mb |
2.0 How to Convert a Parquet file to CSV
Use the following steps to import your Data Hub Parquet files into a Studio map and download it as a CSV.
Step 1 - Open Studio and Click Add Data
Navigate to Studio in the left navigation panel. The icon looks like a map. Once Studio is open click the Add Data button (shown below) to open the Data Import Wizard.
Step 2 - Import from Data Hub
In the Data Wizard select Import from Data Hub to choose which files from Data Hub you would like to import into Studio.
Step 3 - Download Data as CSV from Studio
Once the data has loaded into Studio, click the blue "Download" button at the bottom of the Map Layer panel and select "CSV" this will enable you to download the file as a CSV.