AWS Cloud Operations Blog
Category: Monitoring and observability
Collecting Apache Flink metrics in the Amazon CloudWatch agent
Apache Flink is a distributed stream processing engine. You can run Flink on Amazon EMR as a YARN application. You can view Flink metrics through its web UI, but what if you want to react to them? In this blog post, I’ll show you how to use the CloudWatch agent to collect Flink metrics into […]
Use AWS CloudWatch Contributor Insights to monitor CIS AWS Foundations Benchmark controls
Contributor Insights is a feature of AWS CloudWatch that can be used to analyze log data to create time series that displays contributor data. This will help you understand who or what is impacting your system and application performance by identifying top talkers, pinpointing outliers, finding the heaviest traffic patterns, and ranking the top system […]
Introducing CloudWatch Resource Health to monitor your EC2 hosts
Today, AWS announced Amazon CloudWatch Resource Health, a fully managed solution that customers can use to automatically discover, manage, and visualize the health and performance of Amazon Elastic Compute Cloud (Amazon EC2) hosts across their applications. Resource Health provides a centralized view of your EC2 hosts by performance dimensions such as CPU or memory utilization. […]
Using VPC endpoints for AWS X-Ray
Today, AWS X-Ray announces the general availability of VPC endpoint support, which makes it possible for you to establish a private connection between your VPC and AWS X-Ray. Applications running in your VPC can now communicate with AWS X-Ray to send trace data without going through the public internet. In this post, I will show […]
Monitoring your EC2 server fleet with advanced CloudWatch agent capabilities
Customers who are running fleets of Amazon Elastic Compute Cloud (Amazon EC2) instances use advanced monitoring techniques to observe their operational performance. Capabilities like aggregated and custom dimensions help customers categorize and customize their metrics across server fleets for fast and efficient decision making. Customers need visibility not only into infrastructure metrics (like CPU and […]
How Wealthfront utilizes AWS X-Ray to analyze and debug distributed applications
This blog post was written by Harichandan Pulagam, a Data Engineer at Wealthfront In this blog post, we describe how Wealthfront used AWS X-Ray to streamline the development and operations of a distributed application. About Wealthfront Wealthfront’s mission is to build a financial system that favors people, not institutions. They strive to provide better experiences […]
Delete Amazon CloudWatch Synthetics dependent resources when you delete a CloudFormation stack
Amazon CloudWatch Synthetics allows you to monitor application endpoints more easily. It runs tests on your endpoints every minute, and alerts you if your application endpoints don’t behave as expected. These tests can be customized to check for availability, latency, transactions, broken or dead links, page load errors, load latencies for UI assets, complex wizard […]
Sending CloudFront standard logs to CloudWatch Logs for analysis
Amazon CloudFront is a fast content delivery network (CDN) service that securely delivers data, videos, applications, and APIs to customers globally with low latency, high transfer speeds, all within a developer-friendly environment. CloudFront standard logs (also known as access logs) give you visibility into requests that are made to a CloudFront distribution. The logs can […]
Amazon Managed Grafana preview updated with new capabilities
September 8, 2021: Amazon Elasticsearch Service has been renamed to Amazon OpenSearch Service. See details. Starting today, Amazon Managed Grafana is now available to all AWS customers in public preview and updated with new features and plugins. Amazon Managed Grafana manages the provisioning, setup, scaling, and maintenance of Grafana servers, eliminating the need for customers […]
Proactive monitoring of application configuration deployment using AWS AppConfig and Amazon CloudWatch
While deploying critical changes to large-scale applications, unexpected errors can render the application unavailable to end users until the changes are manually rolled back. As a best practice, many Amazon teams use AWS AppConfig to deploy application configuration changes. AWS AppConfig is a capability of AWS Systems Manager that you can use to create, manage, […]