AWS Partner Network (APN) Blog
Category: AWS Glue
Double Your Speed-to-Data Quality with Elula’s Data Quality Checker and AWS Glue
Data quality is traditionally inconsistent and slow across teams and functions. Custom solutions are purpose-built, of varying quality, and rarely maintained. Elula developed a proprietary Data Quality Checker (DQC) powered by AWS Glue, and an Automated Data Assessment (ADA) user interface to accelerate the identification and prioritization of issues. Elula is an AWS Partner and leading provider of artificial intelligence (AI) software to the financial services sector.
Dremio Cloud is a Lakehouse Platform on AWS That Democratizes Data
Dremio Cloud is a cloud lakehouse platform on AWS that democratizes data and provides self-service access to data consumers by connecting business intelligence users and analysts directly to data on Amazon S3 and beyond. Learn about the benefits of Dremio Cloud, how to set it up, and start using Dremio’s high-performance lakehouse platform in less than 15 minutes. Review Dremio Cloud’s key features and explore a getting started tutorial with sample datasets.
How WANdisco LiveData Migrator Can Migrate Apache Hive Metastore to AWS Glue Data Catalog
Big datasets have traditionally been locked on-premises because of data gravity, making it difficult to leverage cloud-native, serverless, and cutting-edge technologies provided by AWS and its community of partners. Modernizing an on-premises analytics platform takes time, effort, and careful planning. Explore the challenges of migrating large, complex, actively-used structured datasets to AWS and how the combination of WANdisco LiveData Migrator, Amazon S3, and AWS Glue Data Catalog overcome those challenges.
Integrating and Analyzing ESG Data on AWS Using CSRHub and Amazon QuickSight
Environmental, social, and governance (ESG) factors are increasingly important for financial institutions as they look to assess portfolio risk, meet investment mandates, align with customer values, and report on the sustainability of their portfolios. Working closely with CSRHub, a data provider on AWS Data Exchange, learn how AWS has produced a demonstration to illustrate how customers can analyze company-level ESG scoring data with Amazon QuickSight.
Unify On-Premises and Cloud-Hosted Data Assets Using Informatica Enterprise Data Catalog
Systems are growing more complex, cloud applications are growing in adoption, and cloud data lakes are being increasingly deployed. At the same time, organizations need to implement data cataloging solutions to provide data governance, data analytics, or pure metadata management. Informatica Enterprise Data Catalog (EDC) scans and catalogs an enterprise’s data assets, whether hosted on the cloud or stored on-premises.
Change Data Capture from On-Premises SQL Server to Amazon Redshift Target
Change Data Capture (CDC) is the technique of systematically tracking incremental change in data at the source, and subsequently applying these changes at the target to maintain synchronization. You can implement CDC in diverse scenarios using a variety of tools and technologies. Here, Cognizant uses a hypothetical retailer with a customer loyalty program to demonstrate how CDC can synchronize incremental changes in customer activity with the main body of data already stored about a customer.
How to Use AWS Glue to Prepare and Load Amazon S3 Data for Analysis by Teradata Vantage
Customers want to use Teradata Vantage to analyze the data they have stored in Amazon S3, but the AWS service that prepares and loads data stored in S3 for analytics, AWS Glue, does not natively support Teradata Vantage. To use AWS Glue to prep and load data for analysis by Teradata Vantage, you need to rely on AWS Glue custom database connectors. Follow step-by-step instructions and learn how to set up Vantage and AWS Glue to perform Teradata-level analytics on the data you have stored in Amazon S3.