AWS Cloud Operations Blog

Assess Resilience at Scale by using Amazon QuickSight and Amazon Resilience Hub

AWS Resilience Hub helps you to manage and improve the resilience posture of your applications on AWS. It enables you to define your resilience goals, assess your resilience posture against those goals, and implement recommendations for improvement based on the AWS Well-Architected Framework. This benefits individual teams that want to assess their applications. However, for […]

Using Generative AI to Gain Insights into CloudWatch Logs

Have you ever been investigating a problem and opened up a log file and thought “I have no idea what I am looking at. If only I could get a summary of the data.” Observability and log data play an important role in maintaining operational excellence and ensuring the reliability of your applications and services. […]

AWS named as a Challenger in the 2024 Gartner Magic Quadrant for Observability Platforms

AWS has been named as a Challenger in the 2024 Gartner Magic Quadrant for Observability Platforms, previously known as Gartner Application Performance Monitoring (APM) and Observability Magic Quadrant. This report assesses vendors based on their Ability to Execute and Completeness of Vision. Compared to the previous year, AWS has moved up higher on the Ability […]

How Merck Automated AWS Elastic Disaster Recovery Initialization and Monitoring

Blog is guest authored by Nasia Ullas of MSD. Enhancing the resilience and productivity of manufacturing processes is essential for pharmaceutical companies to meet business continuity objectives and innovate continuously. Merck & Co., Inc., also known as MSD outside of the United States and Canada, a global bio-pharmaceutical company, mitigated resilience challenges by adopting AWS […]

Deploy AWS Systems Manager Quick Setup programmatically across your AWS Organization

AWS Systems Manager Quick Setup simplifies setting up AWS services, including Systems Manager, by automating common or recommended tasks in your AWS Organization across AWS accounts and Regions. These tasks include, creating required AWS Identity and Access Management (IAM) instance profile roles and setting up operational best practices, such as periodic patch scans and inventory […]

Automate Standard Operating Procedures (SOPs) execution with AWS Resilience Hub

AWS Resilience Hub is a central location in the AWS Management Console for you to manage and improve the resilience posture of your applications on AWS. AWS Resilience Hub enables you to define your resilience goals, assess your resilience posture against those goals, and implement recommendations for improvement based on the AWS Well-Architected Framework. AWS […]

Streamline AWS Support with AWS Chatbot and Microsoft Teams

ChatOps is a collaborative approach to operations that integrates chat platforms with automation tools and processes. It’s a way to bring together people, tools, and processes in a single chat interface to facilitate communication, collaboration, and execution of tasks within a team or organization. It is critical for an Operations team to act quickly and […]

Generate External IdP User Access Privilege Reports with AWS Audit Manager

Generate External IdP User Access Privilege Reports with AWS Audit Manager

Organizations are required to adhere to industry-specific regulations and certification programs, and a key component of this compliance is the creation of a comprehensive user privilege and access report for cloud infrastructure. Auditors rely on these reports to verify that permissions are tightly controlled at a granular level. However, the complexity of access control mechanisms, […]

Leverage Amazon Q to upgrade Lambda runtime functions

Cloud operations are at the heart of every organization. Operating in the cloud allows IT teams to focus on business outcomes, optimizing IT processes while accelerating software development and innovation. These days, it is no longer a question if your organization is moving to the cloud, but how quickly you can move with security and […]

Using AWS CloudTrail data events to audit your Amazon SNS and Amazon SQS workloads

Customers in highly regulated industries, such as Financial Services or Healthcare and Life Sciences, often need to audit every action made in environments with sensitive data. Regulations like HIPAA or FFIEC, and industry frameworks like the PCI DSS, require granular log entries that record user and administrative actions within an environment containing sensitive data, and […]