Converting a Parquet (gzip) file to csv

How to convert a Parquet (gzip) file to a CSV for use in spreadsheet and GIS software.

1.0 Introduction

Urban SDK offers the ability to import Parquet (gzip) data files from Data Hub into Studio to combine data from multiple different Insights Reports, as well as export Parquet files from Studio into CSV format.

1.1 What is a Parquet (.gzip) file?

Parquet data format is a column-oriented format of data storage, as opposed to the row-based format that comma separated variable (CSV) files use.  This means the data can be compressed into significantly smaller files sizes, as well as be read more efficiently to improve performance while reviewing Insights Reports.

File Size Comparison:

Location Data CSV file size Parquet (gzip) file size
Duval, County, Florida Travel Time Index 20 mb 2.5 mb

 


2.0 How to Convert a Parquet file to CSV

Use the following steps to import your Data Hub Parquet files into a Studio map and download it as a CSV.

Step 1 - Open Studio and Click Add Data

Navigate to Studio in the left navigation panel.  The icon looks like a map.  Once Studio is open click the Add Data button (shown below) to open the Data Import Wizard.


Step 2 - Import from Data Hub

In the Data Wizard select Import from Data Hub to choose which files from Data Hub you would like to import into Studio.


Step 3 - Download Data as CSV from Studio

Once the data has loaded into Studio, click the blue "Download" button at the bottom of the Map Layer panel and select "CSV" this will enable you to download the file as a CSV.