AWS Cloud Operations Blog

Category: Management & Governance

Monitoring Amazon RDS and Amazon Aurora using Amazon Managed Grafana

Organizations running critical applications on AWS using fully managed database services such as Amazon Relational Database Service (Amazon RDS) and Amazon Aurora rely on robust monitoring to ensure that their databases are performant, and cause no service disruptions to their customers. Amazon Managed Grafana is a fully managed and secure data visualization service that you […]

How to Automate Incident Response with PagerDuty and AWS Systems Manager Incident Manager

Incident response is a core operations capability for organizations to develop, and a core element in the AWS Cloud Adoption Framework (AWS CAF). Responding to operations incidents quickly is important to minimize their impacts. Automating incident response helps you scale your capabilities, rapidly reduce the recovery time, and reduce repetitive work by your cloud operations teams. […]

Build Cloud Operations Skills Using the New Getting Started with AWS Audit Manager Training

Are you responsible for your organization’s compliance? Do you want to simplify and automate audit activities? Do you want to make sure your organization is compliant with internal control frameworks and industry standards? If you need to simplify your risk and compliance assessments while automating evidence collection in your AWS cloud environment, then getting started […]

Group Amazon CloudWatch Synthetics canaries for an aggregated view across regions

Customers frequently use CloudWatch canaries to monitor their applications that enables them to identify issues pro-actively and resolve them before they reach their end users. In today’s world with the cloud making it much simpler to expand globally and provision infrastructure across different parts of the world, customers tend to localize their infrastructure to the […]

How to manage multi-account applications with AppRegistry and Resource Access Manager

In previous posts we discussed how AWS Service Catalog AppRegistry helps you group applications and application resources within a single context. You can define applications within AppRegistry by providing a name, description, associations to AWS CloudFormation stacks, and associations to application metadata. It is common for customers to deploy applications with CloudFormation across multiple AWS […]

Using AWS Distro for OpenTelemetry and IAM Roles Anywhere on-premises to ingest metrics into Amazon Managed Service for Prometheus

Customers using Prometheus in self-hosted environments face challenges in managing a highly-available, scalable and secure Prometheus server environment, infrastructure for long-term storage, and access control. Amazon Managed Service for Prometheus, a Prometheus-compatible monitoring service for infrastructure and application metrics, solves these problems by providing a fully-managed environment which is tightly integrated with AWS Identity and […]

Centrally deploy patching operations across your AWS Organization using Systems Manager Quick Setup

Organizations managing cloud infrastructure in Amazon Web Services need effective mechanisms to manage compliance and security for their resources and applications. Previously, customers were able to scan instances daily for missing patches across all instances in their organization through the Host Management Quick Setup Configuration. Additionally, customers could implement patching using default patch baselines in […]

Moving from a single account AWS Config deployment to an Organization wide deployment

As customers become more mature in the cloud, they will start to investigate how they can utilize additional AWS services in order to meet their goals. In many cases the initial phase will involve some research and testing of the service before deploying it across their cloud environment. For customers that may need to maintain […]

How CloudWatch cross-account observability helps JPMorgan Chase improve Federated Data Lake Monitoring

AWS best practices guide customers to deploy their applications across multiple AWS accounts to establish security and billing boundary between teams and to reduce the impact of operational events. As enterprises grow and scale with tons of resources, customers often need a unified observability experience to help them search, visualize, and analyze their cross-account telemetry […]

Top 10 AWS Cloud Operations and Migrations Blog posts of 2022

With 2022 behind us, we want to take the opportunity to highlight our readers and the top blog posts from 2022. A big thank you to all our readers but also our authors who continue to work on delighting our customers with their blog posts. #1 Announcing AWS CloudTrail Lake – a managed audit and […]