• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Analysis.org

Intelligence Analysis in Market Context

  • Sponsored Post
  • Market Research Reports
    • Technology Analysis
  • About
  • Contact

AWS Announces AWS Glue DataBrew

November 11, 2020 By Analysis.org

New visual data preparation tool for AWS Glue enables data scientists and data analysts to clean and normalize data up to 80% faster than traditional approaches to data preparation

NTT DOCOMO, bp, and INVISTA among customers using AWS Glue DataBrew

Today, Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company (NASDAQ:AMZN) announced the general availability of AWS Glue DataBrew, a new visual data preparation tool that enables customers to clean and normalize data without writing code. Since 2016, data engineers have used AWS Glue to create, run, and monitor extract, transform, and load (ETL) jobs. AWS Glue provides both code-based and visual interfaces, and has dramatically simplified extracting, orchestrating, and loading data in the cloud for customers. Data analysts and data scientists have wanted an easier way to clean and transform this data, and that’s what DataBrew delivers, with a service that allows data exploration and experimentation directly from AWS data lakes, data warehouses, and databases without writing code. AWS Glue DataBrew offers customers over 250 pre-built transformations to automate data preparation tasks (e.g. filtering anomalies, standardizing formats, and correcting invalid values) that would otherwise require days or weeks writing hand-coded transformations. Once the data is prepared, customers can immediately start using it with AWS and third-party analytics and machine learning services to query the data and train machine learning models. There are no upfront commitments or costs to use AWS Glue DataBrew, and customers only pay for creating and running transformations on datasets. To get started, visit https://aws.amazon.com/glue/features/databrew.

Preparing data for analytics and machine learning involves several necessary and time-consuming tasks, including data extraction, cleaning, normalization, loading, and the orchestration of ETL workflows at scale. For extracting, orchestrating, and loading data at scale, data engineers and ETL developers skilled in SQL or programming languages like Python or Scala can use AWS Glue. ETL developers often prefer the visual interfaces common in modern ETL tools over writing SQL, Python, or Scala, so AWS recently introduced AWS Glue Studio, a new visual interface to help author, run, and monitor ETL jobs without having to write any code. Once the data has been reliably moved, the underlying data still needs to be cleaned and normalized by data analysts and data scientists that operate in the lines of business and understand the context of the data. To clean and normalize the data, data analysts and data scientists have to either work with small batches of the data in Excel or Jupyter Notebooks, which cannot accommodate large data sets, or rely on scarce data engineers and ETL developers to write custom code to perform cleaning and normalization transformations. In an effort to spot anomalies in the data, highly skilled data engineers and ETL developers spend days or weeks writing custom workflows to pull data from different sources, then pivot, transpose, and slice the data multiple times, before they can iterate with data analysts or data scientists to identify and fix data quality issues. After they have developed these transformations, data engineers and ETL developers still need to schedule the custom workflows to run on an ongoing basis, so new incoming data can automatically be cleaned and normalized. Each time a data analyst or data scientist wants to change or add a transformation, the data engineers and ETL developers need to extract, load, clean, normalize, and orchestrate the data preparation tasks over again. This iterative process can take several weeks to months to complete; and as a result, customers spend as much as 80% of their time cleaning and normalizing data instead of actually analyzing the data and extracting value from it.

AWS Glue DataBrew is a visual data preparation tool for AWS Glue that allows data analysts and data scientists to clean and transform data with an interactive, point-and-click visual interface, without writing any code. With AWS Glue DataBrew end users can easily access and visually explore any amount of data across their organization directly from their Amazon Simple Storage Service (S3) data lake, Amazon Redshift data warehouse, and Amazon Aurora and Amazon Relational Database Service (RDS) databases. Customers can choose from over 250 built-in functions to combine, pivot, and transpose the data without writing code. AWS Glue DataBrew recommends data cleaning and normalization steps like filtering anomalies, normalizing data to standard date and time values, generating aggregates for analyses, and correcting invalid, misclassified, or duplicative data. For complex tasks like converting words to a common base or root word (e.g. converting “yearly” and “yearlong” to “year”), AWS Glue DataBrew also provides transformations that use advanced machine learning techniques like Natural Language Processing (NLP). Users can then save these cleaning and normalization steps into a workflow (called a recipe) and apply them automatically to future incoming data. If changes need to be made to the workflow, data analysts and data scientists simply update the cleaning and normalization steps in the recipe, and they are automatically applied to new data as it arrives. AWS Glue DataBrew publishes the prepared data to Amazon S3, which makes it easy for customers to immediately use it in analytics and machine learning. AWS Glue DataBrew is serverless and fully managed, so customers never need to configure, provision, or manage any compute resources.

“AWS customers are using data for analytics and machine learning at an unprecedented pace. However, these customers regularly tell us that their teams spend too much time on the undifferentiated, repetitive, and mundane tasks associated with data preparation,” said Raju Gulabani, VP of Database and Analytics, AWS. “Customers love the scalability and flexibility of code-based data preparation services like AWS Glue, but they could also benefit from allowing business users, data analysts, and data scientists to visually explore and experiment with data independently, without writing code. AWS Glue DataBrew features an easy-to-use visual interface that helps data analysts and data scientists of all technical levels understand, combine, clean, and transform data.”

AWS Glue DataBrew is generally available today in US East (N. Virginia), US East (Ohio), US West (Oregon), EU (Ireland), EU (Frankfurt), Asia Pacific (Sydney), and Asia Pacific (Tokyo), with availability in additional regions coming soon.

Tokyo-based NTT DOCOMO is the largest mobile service provider in Japan, serving more than 80 million customers. “Our analysts profile and query various kinds of structured and unstructured data in order to better understand usage patterns,” said Takashi Ito, General Manager of Marketing Platform Planning Department, NTT DOCOMO. “AWS Glue DataBrew provides a visual interface that enables both our technical and non-technical users to analyze data quickly and easily. Its advanced data profiling capability helps us better understand our data and monitor the data quality. AWS Glue DataBrew and other AWS analytics services have allowed us to streamline our workflow and increase productivity.”

bp is one of the world’s largest integrated energy companies. “A data lake is a critical part of our analytics strategy. One of the challenges we face is not being able to easily explore data before ingestion into our data lake,” said John Maio, Director, Data & Analytics Platforms Architecture, bp. “AWS Glue DataBrew has sophisticated data profiling functionality and a rich set of built-in transformations. This enables our data engineers to easily explore new datasets in a visual interface and make modifications in order to optimize ingestion and allow analysts to shape the data for their analytics solutions. We see AWS Glue DataBrew as a way to help us better manage our data platform and improve efficiencies in our data pipelines.”

INVISTA, a subsidiary of Koch Industries, is one of the world’s largest integrated producers of chemical intermediates, polymers, and fibers. “Data is critical to optimizing our manufacturing processes. One of the challenges we face is ensuring we have a clean data lake that can serve as the source of truth for our analytics and machine learning applications,” said Tanner Gonzalez, Analytics and Cloud leader, INVISTA. “The data ingested into our data lake often contains duplicate values, incorrect formatting and other imperfections that make it difficult to use in its raw form. Amazon AWS Glue DataBrew will allow our data analysts to visually inspect large data sets, clean and enrich data, and perform advanced transformations. AWS Glue DataBrew will empower our analysts and data scientists to perform advanced data engineering activities, giving them the freedom to explore their data and decreasing the time to derive new insights.”

About Amazon Web Services
For 14 years, Amazon Web Services has been the world’s most comprehensive and broadly adopted cloud platform. AWS offers over 175 fully featured services for compute, storage, databases, networking, analytics, robotics, machine learning and artificial intelligence (AI), Internet of Things (IoT), mobile, security, hybrid, virtual and augmented reality (VR and AR), media, and application development, deployment, and management from 77 Availability Zones (AZs) within 24 geographic regions, with announced plans for 15 more Availability Zones and five more AWS Regions in India, Indonesia, Japan, Spain, and Switzerland. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—trust AWS to power their infrastructure, become more agile, and lower costs. To learn more about AWS, visit aws.amazon.com.

About Amazon
Amazon is guided by four principles: customer obsession rather than competitor focus, passion for invention, commitment to operational excellence, and long-term thinking. Customer reviews, 1-Click shopping, personalized recommendations, Prime, Fulfillment by Amazon, AWS, Kindle Direct Publishing, Kindle, Fire tablets, Fire TV, Amazon Echo, and Alexa are some of the products and services pioneered by Amazon. For more information, visit www.amazon.com/about and follow @AmazonNews.

Filed Under: Briefing

Footer

Recent Posts

  • Apple Delivers a Power Quarter as Growth Reaccelerates Across the Board
  • PayPal’s Reset Moment Feels Less Like a Shuffle and More Like a Bet on Focus
  • Reading the PEG Ratio Across Nvidia, Broadcom, and AMD
  • Nvidia’s $5 Trillion Is Earned, Not Borrowed
  • Taiwan Overtakes UK as World’s 7th-Largest Stock Market
  • Intel Q1 2026: Recovery Signals Strengthen, but the Turnaround Is Still Unfinished
  • Yuan Gains Ground, But the Dollar Still Dominates
  • MongoDB Expands Irish Operations with €74 Million Investment in AI and Engineering Growth
  • ServiceNow Q1 2026: The AI Control Tower Thesis Is Holding
  • Adobe’s $25 Billion Buyback Is a Bet on Itself

Media Partners

  • Market Analysis
  • k4i.com
  • Market Research Media
The Bill Comes Due
The Software-Defined Camera Won. The Open OS Did Not.
Cars Are Computers Now, and Most Carmakers Aren’t
Gartner: Global IT Spending to Hit $6.31 Trillion in 2026, Driven by AI Infrastructure
The SDK Generator Benchmarks: Infrastructure vs. Convenience
Infographic: We Are Likely in the Early Stages of Another Productivity Boom
Infographic: Establishing the National Multimodal Freight Network
Global WiFi Market: Size, Segmentation, Trends, and Forecast to 2030
Synera’s $40M Series B: What the Press Release Isn’t Saying
Amazon’s Globalstar Acquisition Is a Spectrum War Dressed as a Satellite Deal
Mistral Is Building the U.S. Gateway for Israeli Autonomous Weapons
Andon Market: The AI Agent Retail Experiment
ASML Accelerates EUV Production Amid AI Chip Demand
Cyera Acquires Ryft for $100M–$130M
Genki Robotics Reaches $1 Billion Valuation
Google's AI Compute Duopoly
GPT-5.4 Solves the Erdős Problem
Palantir's Civil Liberties Crisis
SS7 and Diameter Vulnerabilities Enable State Surveillance
815 Security Violations, 1,032 Open Vulnerabilities: Inside DCSA's FY2025 Compliance Data
China’s U.S. Treasury Holdings: The Great Repositioning (2021–2025)
Infographic: Why the 2025 CIPA Data Proves the APS-C Renaissance is Real
How WiFi Changed Media
Canva Acquires Simtheory and Ortto to Build End-to-End Work Platform
Netflix Price Hikes, The Economics of Dominance in a Saturated Streaming Market
America’s Brands Keep Winning Even as America Itself Slips
Kioxia’s Storage Gambit: Flash Steps Into the AI Memory Hierarchy
Mamdani Strangling New York
The Rise of Faceless Creators: Picsart Launches Persona and Storyline for AI Character-Driven Content
Apple TV Arrives on The Roku Channel, Expanding the Streaming Platform Wars

Media Partners

  • 3V.org
  • Referently.com
  • Media Presser
Berkshire Hathaway's Annual Meeting Without Warren Buffett
Canelo vs. Benavidez: The Fight Boxing Spent Years Avoiding
Elon Musk's Nvidia Comments and the Market Attention Problem
Generation Z in the Labor Market: What the Data Actually Shows
Harley-Davidson's 2024–2026 Recall and What It Signals
Joel Embiid and the Injury Question That Never Goes Away
Kentucky Derby 2026: What the Result Tells You
Miami Grand Prix 2026 and the American F1 Calculus
Pete Hegseth and the Pentagon's Leadership Vacuum
Sam Altman, xAI, and the AI Industry's Accountability Deficit
Sponsored Post
About
Contact
Photo of the Day: Burano Canal in Winter Light
What Is Travel Tech?
60 GHz WiGig Is Not Dead: Here Is Where It Actually Makes Sense
802.11r, 802.11k, 802.11v: The Three Protocols That Make WiFi Roaming Seamless
HaLow (802.11ah): The Sub-1 GHz WiFi Standard Built for IoT That Nobody Talks About
How Enterprise WiFi Authentication Actually Works: 802.1X and RADIUS Explained
How to Read Your WiFi Signal Strength: What dBm Numbers Actually Mean
China Has Shed $357 Billion in U.S. Treasuries Since 2021
Foreign Debt Holdings Are a Trade Deficit Problem, Not Just a Fiscal One
Foreign Holdings of U.S. Federal Debt Reached $9.2 Trillion in 2025
Japan Holds $1.185 Trillion in U.S. Debt and the Number Tells an Incomplete Story
NAB 2026: Las Vegas and the End of the Broadcast Era
Private Investors Now Dominate Foreign Holdings of U.S. Treasury Debt
The United States Paid $282 Billion in Interest to Foreign Debt Holders in 2025
Why Belgium Holds More U.S. Debt Than Saudi Arabia, and What That Actually Means
Biometric Technologies and Congress: Recent Legislation and Open Questions
Biometric Technologies and Global Security: An Overview

Copyright © 2026 Analysis.org

Media Partners: Technologies · Market Analysis · Market Research · Exclusive Domains · Photography

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Do not sell my personal information.
Cookie SettingsAccept
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT