AWS Architecture Blog

Category: Amazon Simple Storage Service (S3)

Figure 1. Step Functions Express workflow solution

Running a Cost-effective NLP Pipeline on Serverless Infrastructure at Scale

Amenity Analytics develops enterprise natural language processing (NLP) platforms for the finance, insurance, and media industries that extract critical insights from mountains of documents. We provide a scalable way for businesses to get a human-level understanding of information from text. In this blog post, we will show how Amenity Analytics improved the continuous integration (CI) […]

Figure 1. Architecture for batch inference at scale with Amazon SageMaker

Batch Inference at Scale with Amazon SageMaker

Running machine learning (ML) inference on large datasets is a challenge faced by many companies. There are several approaches and architecture patterns to help you tackle this problem. But no single solution may deliver the desired results for efficiency and cost effectiveness. In this blog post, we will outline a few factors that can help […]

Figure 1. App2Container scaling architecture overview

Migrate your Applications to Containers at Scale

AWS App2Container is a command line tool that you can install on a server to automate the containerization of applications. This simplifies the process of migrating a single server to containers. But if you have a fleet of servers, the process of migrating all of them could be quite time-consuming. In this situation, you can […]

Figure 1. Audit Surveillance data lake architecture diagram

How Parametric Built Audit Surveillance using AWS Data Lake Architecture

Parametric Portfolio Associates (Parametric), a wholly owned subsidiary of Morgan Stanley, is a registered investment adviser. Parametric provides investment advisory services to individual and institutional investors around the world. Parametric manages over 100,000 client portfolios with assets under management exceeding $400B (as of 9/30/21). As a registered investment adviser, Parametric is subject to numerous regulatory […]

Overview of services that integrate with CloudWatch and Trusted Advisor for monitoring metrics

Optimizing your AWS Infrastructure for Sustainability, Part III: Networking

In Part I: Compute and Part II: Storage of this series, we introduced strategies to optimize the compute and storage layer of your AWS architecture for sustainability. This blog post focuses on the network layer of your AWS infrastructure and proposes concepts to optimize your network utilization. Optimizing the networking layer of your AWS infrastructure When you […]

Figure 1. River architecture diagram, depicting the flow of data from data producers through the River data ingestion service into Snowflake.

How Cimpress Built a Self-service, API-driven Data Platform Ingestion Service

Cimpress is a global company that specializes in mass customization, empowering individuals and businesses to design, personalize and customize their own products – such as packaging, signage, masks, and clothing – and to buy those products affordably. Cimpress is composed of multiple businesses that have the option to use the Cimpress data platform. To provide […]

Serverless S3 metadata search

Swiftly Search Metadata with an Amazon S3 Serverless Architecture

As you increase the number of objects in Amazon Simple Storage Service (Amazon S3), you’ll need the ability to search through them and quickly find the information you need. In this blog post, we offer you a cost-effective solution that uses a serverless architecture to search through your metadata. Using a serverless architecture helps you […]

Figure 2. AWS Storage Gateway now supports AWS PrivateLink for Amazon S3 endpoints and Amazon S3 Access Points

Connect Amazon S3 File Gateway using AWS PrivateLink for Amazon S3

AWS Storage Gateway is a set of services that provides on-premises access to virtually unlimited cloud storage. You can extend your on-premises storage capacity, and move on-premises backups and archives to the cloud. It provides low-latency access to cloud storage by caching frequently accessed data on premises, while storing data securely and durably in the […]

CloudWatch for monitoring your storage resources

Optimizing your AWS Infrastructure for Sustainability, Part II: Storage

In Part I of this series, we introduced you to strategies to optimize the compute layer of your AWS architecture for sustainability. We provided you with success criteria, metrics, and architectural patterns to help you improve resource and energy efficiency of your AWS workloads. This blog post focuses on the storage layer of your AWS infrastructure and provides […]

Figure 1. Data flow - Source to data lake target

Hybrid Cloud Architectures Using Self-hosted Apache Kafka and AWS Glue

Using analytics to gain insights from a variety of datasets is key to successful transformation. There are many options to consider to realize the full value and potential of our data in a hybrid cloud infrastructure. Common practice is to route data produced from on-premises to a central repository or data lake. Here it can […]