Quick Start

Get up and running with Quilt in minutes! This guide provides multiple learning paths based on your experience level and preferred learning style.

🚀 Choose Your Learning Path

👨‍💻 For Developers - Hands-on Python Tutorial

Start coding immediately with our interactive Python tutorial:

Interactive Python Tutorial - Learn quilt3 through practical examples

📺 For Visual Learners - Video Tutorials

Watch comprehensive video guides:

Complete Video Series - How to work with S3 datasets using Quilt
Duration: ~30 minutes total
Topics: Installation, basic operations, data versioning, collaboration

📊 For Data Scientists - Real Dataset Exploration

Explore production datasets with guided examples:

CORD-19 Dataset Analysis - Real-world COVID research data exploration
Machine Learning with PyTorch - Versioning data and models for rapid ML experimentation

⚡ 5-Minute Quick Start

1. Install Quilt

pip install quilt3

2. Browse Public Data

import quilt3

# Browse available datasets
packages = list(quilt3.list_packages("s3://quilt-example"))
print(f"Found {len(packages)} public datasets")

# Load a sample dataset
pkg = quilt3.Package.browse("examples/hurdat", "s3://quilt-example")
print(pkg)

3. Access Your First File

# Download and read a file (using pkg from previous step)
data_file = pkg["README_NF_QUILT.md"]
content = data_file.get()
print(content)

4. Create Your First Package

import quilt3
import tempfile
import os

# Create a temporary file
with tempfile.NamedTemporaryFile(mode='w', delete=False, suffix='.txt') as f:
    f.write("Hello, Quilt!")
    temp_file = f.name

# Create a new package
new_pkg = quilt3.Package()
new_pkg.set("my_data.txt", temp_file)
new_pkg.set_meta({"description": "My first Quilt package"})

# Clean up
os.unlink(temp_file)

# Note: Pushing requires S3 credentials, so we'll just show the package
print(f"Package created with {len(new_pkg)} files")

🎯 Next Steps

Beginner Path

✅ Complete the 5-minute quick start above
📖 Read the Mental Model to understand Quilt concepts
🔧 Follow the Installation Guide for your environment
📝 Try the Basic Workflows

Intermediate Path

🏗️ Set up your AWS Integration
👥 Configure Team Collaboration
🔍 Learn Advanced Search
📊 Explore Data Visualization

Advanced Path

🔐 Configure Cross-Account Access
⚡ Set up EventBridge Integration
🤖 Implement Automated Workflows
🔧 Use the Admin API

🌐 Explore Open Data

Discover publicly available datasets:

Open Quilt Data Portal - Browse hundreds of public datasets
Featured Collections: COVID-19 research, climate data, genomics, financial datasets
No registration required - Start exploring immediately

💡 Common Use Cases

Data Science Teams

Version control for datasets and models
Reproducible research and experiments
Collaborative data exploration

ML/AI Development

Dataset versioning for model training
Experiment tracking and comparison
Model artifact management

Enterprise Data Management

Centralized data catalog
Data governance and compliance
Cross-team data sharing

Research Organizations

Research data management
Publication-ready data packages
Long-term data preservation

🆘 Need Help?

📖 Documentation: Browse the full Quilt Documentation
💬 Community: Join our Slack Community
🐛 Issues: Report bugs on GitHub
📧 Support: Contact [email protected]

Ready to dive deeper? Continue with the Mental Model to understand how Quilt organizes and manages your data.

PreviousInstallation NextEditing a Package

Last updated 4 months ago

Was this helpful?