Quick Start

Get up and running with Quilt in minutes! This guide provides multiple learning paths based on your experience level and preferred learning style.

🚀 Choose Your Learning Path

👨‍💻 For Developers - Hands-on Python Tutorial

Start coding immediately with our interactive Python tutorial:

📺 For Visual Learners - Video Tutorials

Watch comprehensive video guides:

  • Complete Video Series - How to work with S3 datasets using Quilt

  • Duration: ~30 minutes total

  • Topics: Installation, basic operations, data versioning, collaboration

📊 For Data Scientists - Real Dataset Exploration

Explore production datasets with guided examples:

⚡ 5-Minute Quick Start

1. Install Quilt

pip install quilt3

2. Browse Public Data

import quilt3

# Browse available datasets
packages = list(quilt3.list_packages("s3://quilt-example"))
print(f"Found {len(packages)} public datasets")

# Load a sample dataset
pkg = quilt3.Package.browse("examples/hurdat", "s3://quilt-example")
print(pkg)

3. Access Your First File

# Download and read a file (using pkg from previous step)
data_file = pkg["README_NF_QUILT.md"]
content = data_file.get()
print(content)

4. Create Your First Package

import quilt3
import tempfile
import os

# Create a temporary file
with tempfile.NamedTemporaryFile(mode='w', delete=False, suffix='.txt') as f:
    f.write("Hello, Quilt!")
    temp_file = f.name

# Create a new package
new_pkg = quilt3.Package()
new_pkg.set("my_data.txt", temp_file)
new_pkg.set_meta({"description": "My first Quilt package"})

# Clean up
os.unlink(temp_file)

# Note: Pushing requires S3 credentials, so we'll just show the package
print(f"Package created with {len(new_pkg)} files")

🎯 Next Steps

Beginner Path

  1. ✅ Complete the 5-minute quick start above

  2. 📖 Read the Mental Model to understand Quilt concepts

  3. 🔧 Follow the Installation Guide for your environment

  4. 📝 Try the Basic Workflows

Intermediate Path

  1. 🏗️ Set up your AWS Integration

  2. 👥 Configure Team Collaboration

  3. 🔍 Learn Advanced Search

  4. 📊 Explore Data Visualization

Advanced Path

  1. 🔐 Configure Cross-Account Access

  2. 🤖 Implement Automated Workflows

  3. 🔧 Use the Admin API

🌐 Explore Open Data

Discover publicly available datasets:

  • Open Quilt Data Portal - Browse hundreds of public datasets

  • Featured Collections: COVID-19 research, climate data, genomics, financial datasets

  • No registration required - Start exploring immediately

💡 Common Use Cases

Data Science Teams

  • Version control for datasets and models

  • Reproducible research and experiments

  • Collaborative data exploration

ML/AI Development

  • Dataset versioning for model training

  • Experiment tracking and comparison

  • Model artifact management

Enterprise Data Management

  • Centralized data catalog

  • Data governance and compliance

  • Cross-team data sharing

Research Organizations

  • Research data management

  • Publication-ready data packages

  • Long-term data preservation

🆘 Need Help?


Ready to dive deeper? Continue with the Mental Model to understand how Quilt organizes and manages your data.

Last updated

Was this helpful?