AWS Glue

A serverless data integration service.

Visit Website →

Overview

AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it simple and cost-effective to categorize your data, clean it, enrich it, and move it reliably between various data stores and data streams. AWS Glue consists of a central metadata repository known as the AWS Glue Data Catalog, an ETL engine that automatically generates Python or Scala code, and a flexible scheduler that handles dependency resolution, job monitoring, and retries.

✨ Key Features

  • Serverless ETL
  • Automatic schema discovery (crawlers)
  • Integrated data catalog
  • Visual and code-based job authoring
  • Job scheduling and orchestration

🎯 Key Differentiators

  • Deep integration with the AWS ecosystem
  • Serverless architecture
  • Pay-as-you-go pricing

Unique Value: AWS Glue simplifies and automates the process of data integration on AWS, allowing customers to build and manage data pipelines with ease.

🎯 Use Cases (4)

ETL/ELT pipelines Data preparation for analytics Building a data lake Streaming data integration

✅ Best For

  • Building serverless ETL jobs to process data in Amazon S3
  • Creating and managing a data catalog for a data lake on AWS

💡 Check With Vendor

Verify these considerations match your specific requirements:

  • Complex, multi-cloud or hybrid-cloud orchestration.
  • Workflows that are not primarily data integration tasks.

🏆 Alternatives

Azure Data Factory Google Cloud Data Fusion Talend

Compared to self-managing ETL infrastructure, AWS Glue is more cost-effective, scalable, and easier to manage.

💻 Platforms

Web API

🔌 Integrations

Amazon S3 Amazon Redshift Amazon RDS Amazon DynamoDB JDBC-accessible databases

🛟 Support Options

  • ✓ Email Support
  • ✓ Live Chat
  • ✓ Phone Support
  • ✓ Dedicated Support (AWS Support Plans tier)

🔒 Compliance & Security

✓ SOC 2 ✓ HIPAA ✓ BAA Available ✓ GDPR ✓ ISO 27001 ✓ SSO ✓ SOC 1, 2, 3 ✓ HIPAA ✓ GDPR ✓ ISO/IEC 27001, 27017, 27018 ✓ PCI DSS Level 1

💰 Pricing

Contact for pricing
Free Tier Available

Free tier: Free tier for the Data Catalog and crawlers.

Visit AWS Glue Website →