S3 Select
S3 Select is an S3 feature that allows you to operate on JSON, CSV, and Parquet files in a row-based manner using SQL syntax. QUILT features experimental support for S3 Select queries as part of the Bucket
interface:
$ python
>>> import quilt3
>>> b = quilt3.Bucket("s3://alpha-quilt-example")
>>> b.select("foo/bar.csv", "SELECT * FROM S3Object LIMIT 5")
<<< <pandas DataFrame object at ...>
Last updated
Was this helpful?