LogoLogo
HomeGitHub RepoBook Demo
version-3.1.10
version-3.1.10
  • Introduction
  • Installation
  • Quickstart
  • Walkthrough
    • Editing a Package
    • Uploading a Package
    • Installing a Package
    • Getting Data from a Package
    • Working with the Catalog
    • Working with a Bucket
  • Advanced Usage
    • Filtering a Package
    • .quiltignore
    • Materialization
    • Working with Manifests
    • S3 Select
  • API Reference
    • quilt3
    • quilt3.Package
    • quilt3.Bucket
    • quilt3 CLI
  • References
    • Frequently Asked Questions
    • Technical Reference
    • Contributing
    • Further Reading
Powered by GitBook
On this page
  • Quilt is a versioned data portal for AWS
  • Who is Quilt for?
  • What does Quilt do?
  • How does Quilt work?
  • Use cases
  • Roadmap

Was this helpful?

Introduction

NextInstallation

Last updated 5 years ago

Was this helpful?

Below is the documentation for . See and from Quilt 2.

Quilt is a versioned data portal for AWS

  • is a petabyte-scale open data portal that runs on Quilt

  • includes case studies, use cases, videos, and information on how you can run a private Quilt instance

Who is Quilt for?

Quilt is for data-driven teams of both technical and non-technical members (executives, data scientists, data engineers, sales, product, etc.).

What does Quilt do?

Quilt adds search, visual content preview, and versioning to every file in S3.

How does Quilt work?

Quilt consists of a Python client, web catalog, lambda functions—all of which are open source—plus a suite of backend services and Docker containers orchestrated by CloudFormation. The latter are available under a paid license for private use on .

Use cases

Quilt addresses five key use cases:

  • Share data at scale. Quilt wraps AWS S3 to add simple URLs, web preview for large files, and sharing via email address (no need to create an IAM role).

  • Understand data better through inline documentation (Jupyter notebooks, markdown) and visualizations (Vega, Vega Lite)

  • Discover related data by indexing objects in ElasticSearch

  • Model data by providing a home for large data and models that don't fit in git, and by providing immutable versions for objects and data sets (a.k.a. "Quilt Packages")

  • Decide by broadening data access within the organization and supporting the documentation of decision processes through audit-able versioning and inline documentation

Roadmap

I - Performance and core services

II - CI/CD for data

III - Storage agnostic (support Azure, GCP buckets)

IV - Cloud agnostic

Quilt 3
here
here
open.quiltdata.com
quiltdata.com
quiltdata.com
docs on_gitbook
chat on_slack
codecov
pypi