Showing posts with label Dust. Show all posts
Showing posts with label Dust. Show all posts

Monday, December 15, 2025

Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI

In the past we have written about how one can use social media to monitor dust storms along with how multi-modal large language models (MLLMs) can be used to analyze images. At the recent American Geophysical Union (AGU) Fall Meeting we (Sage Keidel, Stuart Evans and myself) brought these two strands of research together in a poster entitled "Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI."

In this work we showcase how MLLMs are providing new opportunities and accessible methods for information extraction from imagery data using geo-located images from Flickr which have a dust keyword tag associated with it from multiple languages (e.g., Arabic, English, Spanish).  We run these images through ChatGPT, which classifies them as dust storms or not and compare this classification with human classifed images. If this sounds of interest, below you can read the abstract, see the poster along with a selection of images that have been labeled as as dust storm or not and ChatGPTs confidence in its classification. While the dust storm database itself can be found here

Abstract:

Complete observations of dust events are difficult, as dust’s spatial and temporal variability means satellites may miss dust due to overpass time or cloud coverage, while ground stations may miss dust due to not being in the plume. As a result, an unknown number of dust events go unrecorded in traditional datasets. Dust’s importance both for atmospheric processes and as a health and travel hazard makes detecting dust events whenever possible important, and in particular, studies of the health impacts of dust are limited by detailed exposure information. 

In recent years, social media platforms have emerged as a valuable source of unconventional data to study events such as earthquakes and flooding around the world. However, one challenge with respect to using such data is classifying and labeling it (i.e., is it a dust storm or not?). While it is relatively simple to classify textural data through natural language processing, it is not the case with imagery data. Traditionally, classifying imagery data was a complex computer vision task. However, recent advancements in generative artificial intelligence (AI) especially multi-modal large language models (MLLMs) are opening up new opportunities and offering accessible methods for information extraction from imagery data. Therefore, in this study we collected geotagged Flickr images referencing dust from around the globe from multiple languages (e.g., English, Spanish, Arabic) and use generative AI (i.e., ChatGPT) to classify the images as dust storms or not. Furthermore, we compare a sample of these classified images from ChatGPT with human classified images to assess its accuracy in classification. Our results suggest that ChatGPT can relatively accurately detect dust storms from Flickr images and thus helps us create an unconventional global database of dust storm events that might otherwise go unobserved from more traditional datasets.



Workflow

Poster

Dust storm database (click here to go to it)

Full Referece: 
Keidel, S., Evans S. and Crooks, A.T. (2025), Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI, American Geophysical Union (AGU) Fall Meeting, 15th–19th December, New Orleans, LA. (pdf of poster).

Tuesday, May 13, 2025

Crowdsourcing dust storms utilizing social media data

In the past we have explored how social media can be used to delineate earthquakes, study human-wildlife interactions, understand urban morphology, urban smells or  locating wildfires among many other things. 

Keeping with the last topic (i.e., locating things), in a new paper published in GeoJournal entitled "Crowdsourcing dust storms in the United States utilizing social media data," Stuart EvansFestus Adegbola and myself explore how we can use X (formerly Twitter) and Flickr  to source observations of windblown dust. 

As such the paper demonstrates how social media data can act as supplementary source for dust events monitoring and captures the seasonal trends of such events. Furthermore, the paper highlights the potential of using crowdsourced data for the often overlooked field of dust monitoring that has substantial health and economic impacts. 

If this sounds of interest, below we provide the abstract to the paper along with some figures which showcase our methodology and comparison with National Weather Service dust advisories and VIIRS satellite data. At the bottom of the post, you can find the full reference to the paper along with a link to it. 

Abstract: 

Dust storms and other dust events are natural phenomena characterized by strong winds carrying large amounts of fine particles which have significant environmental and human impacts. However, capturing the occurrence of such phenomena is a challenge. Previous studies have limitations due to available data, especially regarding short-lived, intense dust storms and events that are not captured by observing stations and satellite instruments. In recent years, the advent of social media platforms has provided a unique opportunity to access vast amounts of crowdsourced data. This paper explores the utilization of Flickr and X (Twitter) data to study dust event occurrences within the United States and their correlation with National Weather Service (NWS) advisories. The work ascertains the reliability of using crowdsourced data as a supplementary source for dust events monitoring. Our analysis of Flickr and X indicates that the Southwest region is most susceptible to dust events, with Arizona leading in the highest number of occurrences. On the other hand, the Great Plains show a scarcity of crowdsourced data related to dust events, which can be attributed to the sparsely populated nature of the region. Furthermore, seasonal analysis reveals that dust events are prevalent during the Summer months followed by Spring. These results are consistent with previous traditional studies that did not use social media of dust occurrences in the U.S., and Flickr-identified images of dust events show substantial co-occurrence with regions of NWS dust warnings. This paper highlights the potential of using crowdsourced data for the often overlooked field of dust monitoring that has substantial health and economic impacts.
Keywords: Dust storms, Crowdsourcing, Social media, Weather. 

 

Flowchart of our workflow
Selected posts retrieved from X showing active dust events.

Selected images retrieved from Flickr showing active dust events.

Map showing the distribution of flickr-identified dust event occurrences, X-identified dust event occurrences, National Weather Service dust advisories, including dust storm (DS) warnings and blowing dust (DU) advisories.

Seasonal cycle of dust events using social media metadata, the National Weather Service advisories, and the VIIRS satellite data.

Examples of social media identified dust events and satellite observations for the same day. Brown shaded pixels indicate locations Suomi-VIIRS observed dust particles. Any VTEC warnings issued by NWS for the location are shown after the date of each dust event, with HWW and DSW indicating High Wind Warning and Dust Storm Warning, respectively.

Full Referece: 
Adegbola, F., Crooks, A.T. and Evans, S.M. (2025). Crowdsourcing dust storms in the United States utilizing social media data. GeoJournal, 90(3), pp.1-18. Available at https://doi.org/10.1007/s10708-025-11359-9 (pdf)

Saturday, December 14, 2024

AGU

This past week we attended the American Geophysical Union (AGU) Fall Meeting in Washington DC. At the AGU we presented two abstracts. 

The first follows on our work with respect to using synthetic populations within agent-based models. This work was with Na Jiang, Fuzhen Yin and Boyu Wang and entitled "A Framework for Populating Urban Digital Twins with Agents." Or more specially why digital twins need agents. Below you can see our abstract and a couple of figures showing our synthetic population workflow and how we integrate these into agent-based models.  

Abstract:

Over the last few years, considerable efforts have been placed in creating digital twins from diverse fields ranging from engineering to urban planning and many things in-between. These digital twins have benefited from the growth and availability of computational power and data. For example, in urban planning the growth of computational resources and the explosion of spatial data sources(e.g. remote sensing) has lead to the creation and widespread adoption of detailed virtual urban environments or urban digital twins. However, we would argue that many of such works emphasize only the physical infrastructure or the built environment of the city instead of considering the key actors of urban systems: the people who live in them. In this work we aim to remedy this by introducing a framework that utilizes agent-based modeling to add humans to such urban digital twins. This framework consists of two components: 1)synthetic populations generated with census data; and 2) pipeline of using the population datasets for agent-based modeling applications within the urban digital twins domain. To demonstrate the utility of this framework, we have representative applications that showcase how digital twins can be created to study various urban phenomena (e.g., evacuation scenarios, traffic congestion and disease transmission). By doing so, we believe this framework will benefit researchers wishing to build urban digital twins and to explore complex urban issues with realistic populations. 


Workflow of utilizing synthetic populations within agent-based models.
Examples of agent-based models utilizing our synthetic popuation.

In a different presentation, we return to how one can use social media to monitor the world around us, in this case dust storms. This work entitled "Mining unconventional data sources: creating a social media-based catalog of dust events in the Western US" is collaboration with Stuart Evans and Festus Adegbola. Generally speaking we explore how social media has the potential for a new unconventional source of observations of windblown dust. If this sounds of interest, below you can read the abstract to the paper and see the visual overlap between social media posts about dust events and official National Weather Service (NWS) dust storm warning coverage. 

Abstract 

Complete observations of dust events are difficult, as dust’s spatial and temporal variability means satellites may miss dust due to overpass time or cloud coverage, while ground stations may miss dust due to not being in the plume. As a result, an unknown number of dust events go unrecorded in traditional datasets. Dust’s importance both for atmospheric processes and as a health and travel hazard makes detecting dust events whenever possible important, and in particular, studies of the health impacts of dust are limited by detailed exposure information, i.e. where is there dust and when. In recent years, social media platforms have provided an opportunity to access vast user-generated data. This research utilizes geotagged Flickr and Twitter posts referencing dust in the western US, and compares it to traditional datasets including blowing dust reports from the National Weather Service and satellite observations from Suomi-VIIRS. Results show that this unconventional dataset broadly recreates the observed spatial and seasonal distributions of dust. Daily analysis of the locations of the social media posts creates a novel catalog of dust events in the western US that can be used for further research. While this catalog is necessarily incomplete, it nonetheless provides a complementary list of events to those detected by traditional means. Analysis of individual events in this catalog shows that social media captures many dust events that previously went undetected by traditional datasets.


References:

Crooks, A.T., Jiang, N., Yin, F. and Wang, B. (2024), A Framework for Populating Urban Digital Twins with Agents, American Geophysical Union (AGU) Fall Meeting, 9th–13th December, Washington, DC. (pdf)

Evans, S., Adegbola, F. and Crooks, A.T. (2024), Mining Unconventional Data Sources: Creating a Social Media-based Catalog of Dust Events in the Western US, American Geophysical Union (AGU) Fall Meeting, 9th–13th December, Washington, DC. (pdf)

Tuesday, December 19, 2023

Crowdsourcing Dust Storms in the United States Utilizing Flickr

In the past on this site we have written about how one can use social media to study the world around us. Often the focus has been on Twitter but that is not the only social media platform available.  Another is Flickr, and while in past posts have show how we can use this platform to explore bird sightings, wildfires and human migration we are now turning our attention to other phenomena. One of which is dust storms. Working with Festus Adegbola and Stuart  Evans we have just presented a poster at the 2023 American Geophysical Union Fall Meeting entitled "Crowdsourcing Dust Storms in the United States Utilizing Flickr"

In this research we compare Flickr images with National Weather Service  advisories and the VIIRS Deep Blue aerosol product data from the Suomi-NPP satellite. Our preliminary findings show that Flickr images of dust storms have a substantial co-occurrence with regions of NWS blowing dust advisories. If this sounds of interest, below you can read our abstract, see our workflow and the poster itself. 

Abstract

Dust storms are natural phenomena characterized by strong winds carrying large amounts of fine particles which have significant environmental and human impacts. Previous studies have limitations due to available data, especially regarding short-lived, intense dust storms that are not captured by observing stations and satellite instruments. In recent years, the advent of social media platforms has provided a unique opportunity to access a vast amount of user-generated data. This research explores the utilization of Flickr data to study dust storm occurrences within the United States and their correlation with National Weather Service (NWS) advisories. The work ascertains the reliability of using crowdsourced data as a supplementary tool for dust storm monitoring. Our analysis of Flickr metadata indicates that the Southwest is most susceptible to dust storm events, with Arizona leading in the highest number of occurrences. On the other hand, the Great Plains show a scarcity of Flickr data related to dust storms, which can be attributed to the sparsely populated nature of the region. Furthermore, seasonal analysis reveals that dust storm events are prevalent during the Summer months, specifically from June to August, followed by Spring. These results are consistent with previous studies of dust occurrence in the US, and Flickr-identified images of dust storms show substantial co-occurrence with regions of NWS blowing dust advisories. This research highlights the potential of unconventional user-generated data sources to crowdsource environmental monitoring and research.

Data collection and workflow.
Distribution of Flickr identified dust storm occurrences and NWS dust storm advisories.

Full Reference: 

Adegbola, F., Crooks, A.T. and Evans, S. (2023), Crowdsourcing Dust Storms in the United States Utilizing Flickr, American Geophysical Union (AGU) Fall Meeting, 11th – 15th December, San Francisco, CA. (abstract, poster)