Convert CSV to Arrow

Convert CSV to Arrow in seconds with this free online CSV to Arrow converter

Accepts csv


CSV

CSV (Comma Separated Values) files are the most common format for storing tabular data. Values in a row are separated by commas and rows are separated by newlines.

CSQ files often start with a header row that has column names, but this is not required.

Each row in a CSV file must have the same number of values as the header row.

CSV files do no enforce types or a schema. This means that each column can have multiple types, which can make analysis difficult and compression inefficient.

Parquet files can be easier to analyze and compress better than CSV files.

Arrow

Apache Arrow (.arrow) is a format that was designed for storing tabular data in memory (RAM). It was not designed for storing data as files on disk.

Arrow enforces a schema. Which means that every value in a column must have the same value.

Arrow was designed to work well between different data analysis systems without needing to be serialised or deserialized.

Arrow is a binary data format, which means that it can be easily read by computers. But it cannot be read by people. The easiest way to view arrow data is to convert it to CSV first.

It can be best to convert Arrow data to Parquet before saving to disk. Arrow was not designed for storing data to disk, so can be inefficient and slow to query.


Supercharge your data exploration

Open csv, parquet, arrow, json and tsv files straight from your desktop

Or


Share and embed

Share your graphs and data sets.

Share your graphs and data sets. Or embed them directly into web pages.


Work straight from Google Drive

Open csv, parquet, arrow, json and tsv files directly from Drive, Gmail and Classroom by installing the Google Workspace App


How to Convert CSV to Arrow

Viewing converted data
  1. Select your input CSV file
  2. Your CSV file will be converted to Arrow
  3. Download your Arrow file
  4. Click the View button to view your file

How to Convert CSV to Arrow in Python

We can convert CSV to Arrow in Python using Pandas or DuckDB

How to Convert CSV to Arrow using Pandas

First, we need to install pandas

pip install pandas

Then we can load the CSV file into a dataframe

df = pd.read_csv('path/to/file.csv')

Finally, we can export the dataframe to the Arrow format

df.to_feather('path/to/file.arrow', index=False)

How to Convert CSV to Arrow using DuckDB

First, we need to install duckdb for Python

 pip install duckdb

The following DuckDB query will read a CSV file and output a Arrow file

duckdb.sql("""COPY (select * from 'path/to/file.csv') TO 'path/to/file.arrow' (FORMAT 'arrow')""")