AWSGlue

aws-glue

Catalog-Storage

AWS Glue Data Catalog is the metadata repository for for all of your data assets across various data sources. When using Data Catalog, you are charged for storing and accessing table metadata. This billing code represents the fee associated with Catalog Storage. Storage is free for the first million objects stored then there is a fee per 100,000 objects stored beyond that.

Catalog-Request

AWS Glue Data Catalog is the metadata repository for for all of your data assets across various data sources. When using Data Catalog, you are charged for storing and accessing table metadata. This billing code represents the fee associated with Catalog Metadata requests. Requests are free for the first million requests stored then there is a fee per million requests made beyond that.

Crawler-DPU-Hour

There is an hourly rate for AWS Glue crawler runtime to discover data and populate the AWS Glue Data Catalog. You are charged an hourly rate based on the number of Data Processing Units (or DPUs) used to run your crawler. A single DPU provides 4 vCPU and 16 GB of memory. You are billed in increments of 1 second, rounded up to the nearest second, with a 10-minute minimum duration for each crawl.

ETL-DPU-Hour

This billing code applies to standard AWS Glue ETL jobs, which extract, transform, and load data between sources. Charges are based on the number of DPUs allocated to the job and the time it runs. Billing increments are measured in seconds, with a 1-minute minimum per job.

ETL-Flex-DPU-Hour

AWS Glue Flex is a flexible pricing model for ETL (Extract, Transform, Load) jobs that allows for fine-grained billing. You are charged for the exact DPU usage during the job's execution time rather than a fixed hourly rate. This code represents the cost associated with Flex jobs, offering cost savings for short-duration or intermittent jobs.

DBrew-Node-Hour

AWS Glue DataBrew is a visual tool that helps data analysts and scientists clean, normalize, and transform data for analytics and machine learning. With DataBrew, you only pay for the time that you use to clean and normalize data when you are running the jobs. You are charged an hourly rate based on the number of DataBrew nodes used to run your job. By default, DataBrew allocates 5 nodes to each job. DataBrew jobs have a 1-minute billing duration and have a rate set per hour.

GlueInteractiveSession-DPU-Hour

This billing code applies to AWS Glue Interactive Sessions, which allow developers to perform interactive data preparation and analytics using notebooks or REPL environments. Charges are based on the number of DPUs allocated to the session, multiplied by the duration it runs, with billing increments of 1 second and a 10-minute minimum duration.

DEVED-DPU-Hour

This billing code represents DPU hours consumed in development endpoints, which are interactive environments for developers to test and debug AWS Glue scripts. Charges are based on the number of DPUs provisioned for the endpoint and the duration of use.

Optimization-DPU-Hour

This billing code applies to the DPU hours used during the data optimization phase in AWS Glue ETL jobs. Optimization tasks include reorganizing, partitioning, or compressing data to improve query performance and reduce storage costs.

MemOptimized-DPU-Hour

AWS Glue jobs requiring memory-intensive operations use memory-optimized DPUs. This billing code reflects the cost associated with such DPUs, which provide enhanced memory capacity for demanding ETL workloads.

DBrew-FreeSessions

A DataBrew interactive session refers to a live, interactive workspace within the AWS Glue DataBrew tool where users can visually explore, clean, and transform data from various sources without writing code, essentially allowing them to perform data preparation tasks in a real-time, hands-on manner using a graphical interface; each active session within a DataBrew project is considered a separate unit for billing purposes. The first 40 interactive sessions are free for first-time users of DataBrew and this billing code is representative of that.

Column-Statistics-DPU-Hour

AWS Glue computes column-level statistics for datasets as part of its data profiling and transformation processes. This billing code represents the DPU hours used specifically for calculating column statistics during these tasks, enabling insights into data distributions and quality.

DBrew-Sessions

A DataBrew interactive session refers to a live, interactive workspace within the AWS Glue DataBrew tool where users can visually explore, clean, and transform data from various sources without writing code, essentially allowing them to perform data preparation tasks in a real-time, hands-on manner using a graphical interface; each active session within a DataBrew project is considered a separate unit for billing purposes. A session is initiated after the data in a AWS Glue DataBrew project is loaded. You are billed for the total number of the sessions used, which are calculated in 30-minute increments.