AWS Storage Blog
Maximizing price performance for big data workloads using Amazon EBS
Since the emergence of big data over a decade ago, Hadoop – an open-source framework that is used to efficiently store and process large datasets – has been crucial in storing, analyzing, and reducing that data to provide value for enterprises. Hadoop lets you store structured, partially structured, or unstructured data of any kind across […]
Accelerating GPT large language model training with AWS services
GPT, or Generative Pre-trained Transformer, is a language model that has shown remarkable progress in various vertical industries. This technology has been used to generate human-like text in fields such as finance, healthcare, legal, marketing, and many others. In finance, GPT is being used to analyze financial data, generate reports, and assist with decision-making. In […]
How Goldman Sachs leverages AWS PrivateLink for Amazon S3
As a multinational investment bank and financial services company, Goldman Sachs (GS) stores diverse datasets at scale that must always be accessible whilst remaining secure and compliant with regulations and requirements. As a part of its process, Goldman Sachs leverages Amazon Virtual Private Clouds (VPC) to provide secure environments for deployment of resources within AWS, […]
Using AWS Systems Manager to upgrade from CloudEndure Disaster Recovery to AWS Elastic Disaster Recovery
AWS Elastic Disaster Recovery (DRS) is the recommended service for disaster recovery to AWS. It’s the next generation of CloudEndure Disaster Recovery (CEDR), as CloudEndure Disaster Recovery technology was used to build Elastic Disaster Recovery. Now you can upgrade replicating source servers from CloudEndure Disaster Recovery to Elastic Disaster Recovery. This is accomplished via the CEDR Server […]
How Canva saves over $3 million annually in Amazon S3 costs
Canva is an online design tool that empowers users worldwide to design, edit, and publish anything they can dream up. Canva runs most of its production workloads on AWS, using several core services, including Amazon S3, Amazon ECS, Amazon RDS, and Amazon DynamoDB. Running on AWS has helped Canva move fast and keep up with […]
Improve application resiliency with Amazon EBS volume metrics and AWS Fault Injection Simulator
When business build applications, it is crucial to monitor those applications and take measures if and when needed to avoid downtime, and potentially, revenue loss. Choosing the correct metrics to monitor and setting up alarms as needed is table stakes for customers to achieve their application resiliency and availability goals. Amazon Elastic Block Store (Amazon […]
Automating Amazon CloudWatch dashboard creation for Amazon EBS volume KPIs
Enterprises can benefit significantly from optimizing block storage performance in the cloud. One of the primary benefits is having faster and more reliable data access to support critical business operations and real-time applications by achieving reduced latency and higher throughput. Another benefit is realizing cost savings by increasing storage operational efficiency to reduce the need […]
Simplifying operations for VMware workloads using AWS Backup and VMware Cloud on AWS
Customers use VMware to power their business critical applications on premises. As they adopt cloud, they leverage VMware Cloud on AWS as a vehicle to migrate without refactoring their applications. Customers will still need to protect their application data, provide disaster recovery from events, and migrate data if needed. For a large-scale migration with limited […]
Migrating mixed file sizes with the snow-transfer-tool on AWS Snowball Edge devices
When moving your applications and business infrastructure to AWS, it is likely you will need to migrate your existing data as well. This data often comes from file share environments and contains a variety of file sizes. If the data contains more than a single digit percentage of files under 1 MB, your migration performance […]
Creating an ETL pipeline trigger for existing AWS DataSync tasks
Organizations look for ways to leverage the compute power of the cloud to analyze their data and produce reports to help drive business decisions. They want to load their data sets into extract-transform-load (ETL) pipelines for data processing. Once the data is processed, business decision makers at these organizations rely on accurate report generation to […]