AWS Cloud Operations Blog
Category: Management & Governance
The Importance of Key Performance Indicators (KPIs) for Large-Scale Cloud Migrations
Key performance indicators (KPIs) are quantifiable measurements that help you understand how well you’re performing in specific areas. For example, from an incident management perspective, you may measure the mean time to recovery to understand how long it takes to recover following an incident. Large-scale enterprise migration programs (such as vacating a data center or […]
Know Before You Go – AWS re:Invent 2022 Centralized Operations Management
Whatever stage you are at in your process of moving to or operating in the cloud, AWS offers centralized operations management services that you can use to manage and operate your applications on AWS, on-premises, in hybrid environments, or at the edge. Operate your applications from a central location with automation, integrations, built-in best practices, […]
Know Before You Go – AWS re:Invent 2022 Compliance & Auditing
As organizations scale by moving more of their workloads to the cloud, they are looking to manage their cloud operations securely and to be prepared for compliance and auditing. AWS Cloud Operations aims to improve the compliance and auditing process in the cloud through best-in-class services by the scale and security of AWS infrastructure, per […]
Know Before You Go – AWS re:Invent 2022 Monitoring & Observability
Whether you are building out applications in the cloud, modernizing your environment, or migrating workloads, observability is vital to your success. Monitoring and observability provide operational visibility and insight into your workloads and are crucial to operational excellence. AWS Observability will be at re:Invent 2022 to share how you can leverage observability for your organization. […]
Automate AIOps for your microservices in AWS using Amazon DevOps Guru and AWS Systems Manager Incident Manager
Artificial intelligence operations (AIOps) is the process of using machine learning techniques to solve operational problems. The goal of AIOps is to reduce human intervention in IT operations processes. By using advanced machine learning techniques, you can reduce operational incidents and increase service quality, and AIOps can help you predict incidents before they happen. Amazon […]
How to develop an Observability strategy – Part 2
Your observability strategy starts with your business. “Observability” describes how well you can understand what’s happening in a system. Developing an observability strategy isn’t a one-time effort. It’s a continuous improvement effort that occurs throughout the lifecycle of your workloads. It enables your teams to determine whether or not the workloads they design and run […]
Build Cloud Operations skills using the new Getting Started with AWS CloudTrail Training
Are you an organization that needs help with Configuration, Compliance, and Auditing? Do you need to gain visibility of your organization’s account activity across AWS infrastructure? AWS CloudTrail records actions taken by users, roles, or even an AWS service. CloudTrail records actions taken in the AWS Management Console, AWS Command Line Interface (AWS CLI), AWS […]
Build Cloud Operations Skills Using the New Getting Started with AWS Config Training
Are you responsible for your company’s compliance? Do you want to make sure that your AWS resources are aligned to your company’s desired configurations? And make sure how to automate the remediations of noncompliant resources? Do you see an opportunity for your organization to automate its continuous compliance at scale? If you need to understand […]
Cost Optimization recommendations for AWS Config
In this post, we’ll walk you through the various best practices and recommendations for optimizing AWS Config costs. This also provides technical guidance for looking at the rules and the recorder, how to start deleting or removing rules that aren’t needed, and then editing the Settings of Config, specifically the “Resource types to record”, to […]
Monitoring the availability and health of on-premises application using AWS CloudWatch Synthetics
Amazon CloudWatch is a monitoring and observability service that provides you with data and actionable insights to monitor your applications, respond to system-wide performance changes, and optimize resource utilization. You can utilize various CloudWatch capabilities to monitor the health of your application that is available over the internet, or resides within an Amazon Virtual Private Cloud (Amazon VPC) […]