AWS Public Sector Blog

Tag: open data

close up of laptop showing tracking of sharks via OCEARCH along the US Carolina coastline

Assessing the ocean’s health by monitoring shark populations

OCEARCH is a data-centric organization built to help scientists collect previously unattainable data about the ocean. Their mission is to accelerate the ocean’s return to balance and abundance, through innovation in scientific research, education, outreach, and policy, using unique collaborations of individuals and organizations in the US and abroad. As part of the Amazon Sustainability Data Initiative (ASDI), we invited Fernanda Ubatuba, president and COO at OCEARCH, to share how her organization is making strides in helping ocean conservation and how AWS is supporting her mission.

How the cloud is helping remove barriers to addressing climate change

What if we were to democratize access to data and compute so that anyone, anywhere in the world could contribute to climate science? The Amazon Sustainability Data Initiative (ASDI) seeks to accelerate sustainability research and innovation by minimizing the cost and time required to acquire and analyze large sustainability datasets. ASDI supports innovators and researchers with the data, tools, and technical expertise they need to advance sustainability initiatives. ASDI is committed to making climate-relevant data easier to access and analyze. ASDI’s growing data catalog comprises petabytes of open data.

aerial view of green field and fog

Now available: CMIP6 dataset to foster climate innovation and study the impact of future climate conditions

Today, Amazon announced that it is now hosting petabytes of data from the largest and most updated climate simulation dataset in the world. Through two cloud grants from the Amazon Sustainability Data Initiative (ASDI) to the Earth System Grid Federation (ESGF), Amazon is enabling climate researchers worldwide to access and analyze the dataset used for the United Nation’s Intergovernmental Panel on Climate Change’s Sixth Assessment Report (IPCC-AR6) on the AWS Cloud. The report—scheduled to be published in May 2022—provides policymakers worldwide with the latest assessment of the scientific basis of climate change, its impacts and future risks, and options for adaptation and mitigation. The climate simulation dataset, also known as the Coupled Model Intercomparison Project Phase 6 (CMIP6) data archive, traditionally hosted and distributed through the ESGF servers, aggregates the climate models created across approximately 30 working groups and 1,000 researchers working on IPCC-AR6.

aerial street map Singapore

NUS Urban Analytics Lab scales research globally with AWS

The Urban Analytics Lab at the National University of Singapore (NUS) spearheads research in geospatial data analysis and 3D city modelling. The lab’s work underpins the development of smart cities and provides scientists, architects, urban planners, and real estate developers with data insights. These insights help parties make informed decisions about projects ranging from energy modelling to urban farming. To meet rising global demand for its data analytics and planning tools, Urban Analytics Lab turned to Amazon Web Services (AWS).

aerial view of forest and ocean

SARS-CoV-2 viral genomes, storm surge forecasts, cloud-free satellite imagery: The latest open data on AWS

The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on Amazon Web Services (AWS). We work with data providers to democratize access to data by making it available for analysis on AWS; develop new cloud-native techniques, formats, and tools that lower the cost of working with data; and encourage the development of communities that benefit from access to shared datasets. Our full list of publicly available datasets are on the Registry of Open Data on AWS. This quarter, we released 28 new datasets including data on SARS-CoV-2 viral genomes, storm surge forecasts, and US census data. Check out some highlights.

SD 1020 approaches Point Bluff, New Zealand in stormy conditions after finishing the first Saildrone Antarctic Circumnavigation, sailing 22,000 kilometers around the Southern Ocean in 196 days.

Collecting data in remote oceans with a cost-efficient, scalable, and flexible infrastructure

Saildrone builds and operates a fleet of unmanned surface vehicles (USVs) designed to collect high-resolution oceanographic and atmospheric data in remote oceans. Known as saildrones, each vehicle can stay at sea for up to 12 months, transmitting real-time data via satellite. The data collected is used to inform climate models and extreme weather prediction, maritime domain awareness, maps and charts, and sustainable management of resources. Using clean, renewable wind and solar power, saildrones provide access to the world’s oceans at a fraction of the cost of traditional ship-based methods, while drastically reducing the carbon footprint of global ocean observation.

Serverless GIS

Serverless GIS with Amazon S3, open data, and ArcGIS

If you are hosting an ArcGIS web app today, then you are probably hosting it on a Windows or Linux server using traditional web server software like IIS or Apache. With the web hosting capability of Amazon S3 you can remove the need to run these servers and the maintenance, management, and monitoring overhead that comes with it. Serverless services like Amazon S3 can scale automatically and can be as simple as copying over your website assets to get up and running in minutes. This blog focuses on web app implementations using ArcGIS API for JavaScript (as other ArcGIS web apps have additional considerations).

A "cryptic pocket" inside the main protease, identified during Folding@home simulations.

Crowdsourcing a cure for COVID-19: How the cloud and Folding@home are accelerating research and drug discovery

Today more than 200,000 volunteers around the world are helping accelerate research toward COVID-19 therapies—by walking away from their computers. That’s because of a concept called distributed computing, which allows anyone with a home computer, laptop, or virtual machine to contribute computing power to a common cause. This month, nonprofit Folding@home has started sharing one of the world’s largest public protein simulation databases as an AWS Open Data Set so that researchers around the world can easily access this data to speed up the search for therapies for COVID-19.

cloud horizon

Building cloud-based community knowledge about machine learning to predict and understand extreme weather

The National Center for Atmospheric Research (NCAR) is a federally funded research and development center sponsored by the National Science Foundation. It engages in large-scale Earth system science research projects in collaboration with the broader university community. NCAR hosts visitors from around the world, develops community models including the Community Earth System Model and the Weather Research and Forecasting Model, and maintains supercomputers, observational systems, and aircraft to support further study on the how the planet works. As part of the Amazon Sustainability Data Initiative, we invited Dr. David John Gagne, machine learning (ML) scientist at NCAR, to share how open data and machine learning on AWS are impacting the way we predict and understand extreme weather.

coronavirus

Taking COVID in STRIDES: The National Center for Biotechnology Information makes coronavirus genomic data available on AWS

AWS and the National Institutes of Health’s (NIH) National Center for Biotechnology Information (NCBI) announced the creation of the Coronavirus Genome Sequence Dataset to support COVID-19 research. The dataset is hosted by the AWS Open Data Sponsorship Program and accessible on the Registry of Open Data on AWS, providing researchers quick and easy access to coronavirus sequence data at no cost for use in their COVID-19 research.