GIS and Agent-Based Modeling: 2025

Monday, December 15, 2025

Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI

In the past we have written about how one can use social media to monitor dust storms along with how multi-modal large language models (MLLMs) can be used to analyze images. At the recent American Geophysical Union (AGU) Fall Meeting we (Sage Keidel, Stuart Evans and myself) brought these two strands of research together in a poster entitled "Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI."

In this work we showcase how MLLMs are providing new opportunities and accessible methods for information extraction from imagery data using geo-located images from Flickr which have a dust keyword tag associated with it from multiple languages (e.g., Arabic, English, Spanish). We run these images through ChatGPT, which classifies them as dust storms or not and compare this classification with human classifed images. If this sounds of interest, below you can read the abstract, see the poster along with a selection of images that have been labeled as as dust storm or not and ChatGPTs confidence in its classification. While the dust storm database itself can be found here

Abstract:

Complete observations of dust events are difficult, as dust’s spatial and temporal variability means satellites may miss dust due to overpass time or cloud coverage, while ground stations may miss dust due to not being in the plume. As a result, an unknown number of dust events go unrecorded in traditional datasets. Dust’s importance both for atmospheric processes and as a health and travel hazard makes detecting dust events whenever possible important, and in particular, studies of the health impacts of dust are limited by detailed exposure information.
In recent years, social media platforms have emerged as a valuable source of unconventional data to study events such as earthquakes and flooding around the world. However, one challenge with respect to using such data is classifying and labeling it (i.e., is it a dust storm or not?). While it is relatively simple to classify textural data through natural language processing, it is not the case with imagery data. Traditionally, classifying imagery data was a complex computer vision task. However, recent advancements in generative artificial intelligence (AI) especially multi-modal large language models (MLLMs) are opening up new opportunities and offering accessible methods for information extraction from imagery data. Therefore, in this study we collected geotagged Flickr images referencing dust from around the globe from multiple languages (e.g., English, Spanish, Arabic) and use generative AI (i.e., ChatGPT) to classify the images as dust storms or not. Furthermore, we compare a sample of these classified images from ChatGPT with human classified images to assess its accuracy in classification. Our results suggest that ChatGPT can relatively accurately detect dust storms from Flickr images and thus helps us create an unconventional global database of dust storm events that might otherwise go unobserved from more traditional datasets.

Workflow

Poster

Dust storm database (click here to go to it)

Full Referece:

Keidel, S., Evans S. and Crooks, A.T. (2025), Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI, American Geophysical Union (AGU) Fall Meeting, 15th–19th December, New Orleans, LA. (pdf of poster).

Friday, December 12, 2025

Quantitative Comparison of Population Synthesis Techniques

In the past we have written a number of posts on synthetic populations, however, one thing we have not done is compare the various techniques that can be used to create them. This has now changed with a new paper entitled "Quantitative Comparison of Population Synthesis Techniques" which was recently presented at the 2025 Winter Simulation Conference.

In this paper, we (David Han, Samiul Islam, Taylor Anderson, Hamdi Kavak and myself) investigate five synthetic population generation techniques (e.g., Iterative Proportional Fitting, Conditional Probabilities, Simple Random Sampling, Hill Climbing and Simulated Annealing) in parallel to synthesize population data for different North America settings (e.g., Fairfax County, VA, USA and Metro Vancouver, BC, Canada). Our findings suggest that while iterative proportional fitting and conditional probabilities techniques perform best, it also suggests at the same time that it is important to consider the basis of choosing certain methods over others for generating synthetic populations with regard to a geographic domain.

If this sounds of interest, below you can read the abstract to the paper, see some of the figures and tables that support our discussion. While at the bottom of the post you can find the full referece and a link to the paper. Moreover, in an effort to allow for reproducible science, all code and data are available to interested readers in our GitHub repository located at https://github.com/kavak-lab/synthetic-pop-comparison.

Abstract

Synthetic populations serve as the building blocks for predictive models in many domains, including transportation, epidemiology, and public policy. Therefore, using realistic synthetic populations is essential in these domains. Given the wide range of available techniques, determining which methods are most effective can be challenging. In this study, we investigate five synthetic population generation techniques in parallel to synthesize population data for various regions in North America. Our findings indicate that iterative proportional fitting (IPF) and conditional probabilities techniques perform best in different regions, geographic scales, and with increased attributes. Furthermore, IPF has lower implementation complexity, making it an ideal technique for various population synthesis tasks. We documented the evaluation process and shared our source code to enable further research on advancing the field of modeling and simulation.

A conceptual depiction of the IPF process for population synthesis.

Our four-step process used in this study.

Average R² values by geographic level and method (standard deviations in italics).

% Total absolute error (% TAE) comparison by attribute for Fairfax County.

Full Referece:

Han, D., Islam, S., Anderson, T., Crooks, A.T. and Kavak, H. (2025), Quantitative Comparison of Population Synthesis Techniques, in Azar, E., Djanatliev, A., Harper, A., Kogler, C., Ramamohan, V., Anagnostou, A. and Taylor, S.J.E. (eds.), Proceedings of the 2025 Winter Simulation Conference, Seattle, WA, IEEE. pp. 151-162. (pdf)

Friday, November 28, 2025

Integration of Community Level Data into Mathematical Models

In the past we have posted about how we can utilize data and models to explore pandemics and peoples reactions to them. And while interest in the COVID might of waned, there will be future pandemics.

To this end, at the 53rd Annual Meeting of NAPCRG we (Laurene Tumiel Berhalter, Sanchit Goel, Dawn Vanderkooi, Bruce Pitman, Yinyin Ye, Jennifer Surtees and myself) had a poster entitled "Integration of Community Level Data into Mathematical Models to Predict Future Public Health Emergencies." The objective of the poster is to showcase how one can integrate 211 data into models to predict future public health emergencies. If this sounds of interest, below you can see the poster and at the bottom of the post you can access the abstract.

Full Reference:

Tumiel, L.M., Goel, S., Vanderkooi, D., Pitman E.B., Crooks A.T., Ye, Y. and Surtees, J. (2025), Integration of Community Level Data into Mathematical Models to Predict Future Public Health Emergencies, North American Primary Care Research Group (NAPCRG) 53rd Annual Meeting, 21st-25th November, Atlanta, GA (pdf).

Saturday, November 08, 2025

New Paper: Modeling Wildfire Evacuation with Embedded Fuzzy Cognitive Maps

While we have explored disasters in the past through agent-based models and other computational social science approaches, one area we have not explored is how one can use agent-based models to explore evacuations durring a wild fire event. This has now changed with a new paper with Zhongyu Zhou and myself entitled "Modeling Wildfire Evacuation with Embedded Fuzzy Cognitive Maps: An Agent-Based Simulation of Emotion and Social Contagion" which was recently presented at the 2025 International Conference of the Computational Social Science Society of the Americas (CSSSA).

In the paper we present an agent-based model combined with an embedded fuzzy cognitive map (FCM) to simulate residents’ evacuation behavior during a wildfire event. If this sounds of interest, below we provide the abstract to the paper along with some of the figures that showcase the model logic and some of its results. A detailed ODD, the model and the data needed to run the model can be found at: https://github.com/ozzyzhou99/LA-Wildfire-Model/. Finally, at the bottom of the post you can find the full referece to the paper and a link to it.

Abstract:

Wildfires are becoming increasingly dangerous, especially in densely populated fire-prone areas like Los Angeles. People’s evacuation decisions during wildfire events are influenced by many factors, including emotions such as fear or panic, which often affect people’s choices to evacuate. Traditional evacuation models often assume that individuals behave rationally. As a result, these models tend to overlook the influence of emotional factors on evacuation behavior. To address this issue, this study develops an agent-based model (ABM) combined with an embedded fuzzy cognitive map (FCM) to simulate residents’ evacuation behavior during a wildfire event. The model covers two types of agents: evacuees and rescuers. It focuses on how emotions change over time and how they spread among people. While we also expect to observe how these emotional changes will affect evacuation decisions. This research also considers differences between different income groups to explore whether low-income residents are more likely to panic. Results from the model show that agents with different emotions behave differently during the evacuation process. Emotional changes clearly affect how agents choose routes and whether they can respond quickly. In addition, the results suggest that income level affects emotional responses, and low-income groups are more likely to feel fear. This study highlights the value of using ABM and FCM together to better understand evacuation behavior and provides a new idea for developing fairer and more effective disaster response plans.
Keywords: Agent-Based Modeling, Emotional decision-making, GIS, Fuzzy Cognitive Map, Wildfire Evacuation.

Data used in the setting up the model experiment. (A) is household income data, (B) is location of previously affected houses, and (C) is evacuation road data.

Agent-level embedded FCM loop with social contagion.

Evacuees’ Workflow (A), Rescuers” Workflow (B).

Box plots of average emotions for three groups of experiments (50 repetitions each). From left to right, the number of people in each income group increases progres- sively. Low income (LI), middle income (MI), and high income (HI).

Full Referece

Zhou, Z. and Crooks, A.T. (2025), Modeling Wildfire Evacuation with Embedded Fuzzy Cognitive Maps:An Agent-Based Simulation of Emotion and Social Contagion, Proceedings of the 2025 International Conference of the Computational Social Science Society of the Americas, Santa Fe, NM. (pdf)

Thursday, November 06, 2025

HD-GEN: A Software System for Large-Scale Human Mobility Data Generation Based on Patterns of Life

Human mobility datasets are essential for investigating human behavior, mobility patterns, and traffic dynamics. In the past we have written about how one can use agent-based models to generate patterns of life trajectories datasets. Building on this work at the ACM SIGSPATIAL 2025 conference, we (Hossein Amiri, Richard Yang, Shiyang Ruan, Joon-Seok Kim, Hamdi Kavak, Andrew Crooks, Dieter Pfoser, Carola Wenk and Andreas Züfle) had a paper entitled "HD-GEN: A Software System for Large-Scale Human Mobility Data Generation Based on Patterns of Life"

In this paper, we extend our previous work by introducing a software system that provides a new suite of tools built on top of the Patterns of Life simulation framework. Specifically this work consolidates our contributions into a unified data generation pipeline that includes:

additional discussion of the motivation and applications of large-scale simulated trajectory data,
detailed instructions on running the simulation and generating datasets,
extended analysis of the shared dataset, and
an integrated GitHub repository

The proposed system enables large-scale synthetic dataset generation, either by statistically replicating real-world data or by creating datasets with user-defined properties. If this sounds of interest, below you can read the abstract to the paper, the poster that accompanies it and we have also provided detailed instructions on how to reproduce the generated datasets, and made the code and data available at https://github.com/onspatial/large-scale-dataset-generator.

Abstract

Understanding individual human mobility is critical for a wide range of applications. Real-world trajectory datasets provide valuable insights into actual movement behaviors but are often constrained by data sparsity and participant bias. Synthetic data, by contrast, offer scalability and flexibility but frequently lack realism. To address this gap, we introduce a comprehensive software pipeline for generating, calibrating, and processing large-scale human mobility datasets that integrate the realism of empirical data with the control and extensibility of Patterns-of-Life simulations. Our system consists of three integrated components. First, a genetic algorithm–based calibration module fine-tunes simulation parameters to align with real-world mobility characteristics, such as daily trip counts and radius of gyration, enabling realistic behavioral modeling. Second, a data generation engine constructs geographically grounded simulations using OpenStreetMap data to produce diverse mobility logs. Third, a data processing suite transforms raw simulation logs into structured formats suitable for downstream applications, including model training and benchmarking.
Keywords: GeoLife, Patterns of Life, Simulation, Realistic Trajectory Datasets

Dataset creation phases with HD-GEN software.

Full Reference:

Hossein, A., Yang, R., Ruan, S., Kim, J-S., Kavak, H., Crooks, A.T., Pfoser, D., Wenk, C. and Züfle, A., (2025). HDGEN: A Software System for Large-Scale Human Mobility Data Generation Based on Patterns of Life. In The 33rd ACM International Conference on Advances in Geographic Information Systems (SIGSPATIAL ’25), November 3–6, 2025, Minneapolis, MN. pp. 407-410. (pdf) (poster)

Thursday, October 09, 2025

Call for Papers: Geosimulation and Its Emerging Directions with AI

As part of the GeoAI and Deep Learning Symposium at the 2026 AAG Annual Meeting in San Francisco, California we have a call for papers for sessions entitled "Geosimulation and Its Emerging Directions with AI"

Call for Papers:

Simulating past, present, and future events can empower humans to understand the composition and interactions in complex systems and explain their emergence and evolution from bottom up. In practice, geosimulations constitute a powerful tool in engaging different stakeholders, exploring what-if scenarios, and evaluating alternative policy outcomes.

We invite interdisciplinary works for the exploration and understanding of complex social and environmental processes by means of computer simulation. We focus on all aspects of simulation and agent societies, including multi-agent systems, agent-based modeling, microsimulation, artificial intelligence (AI) agents, and the integration of Generative AI with simulation.

As GenAI is impacting all aspects of our lives, we are wondering how it will impact geospatial simulations. How do multimodal large language models (MLLMs) help with agent-decision making in the form of generating agent-personas or scheduling agent activities? Can MLLMs reduce coding barriers for beginners? Will GenAI lead to a new generation of modeling toolkits? What are the challenges brought by MLLMs in model design, validation, and computing costs?

We welcome a wide range of studies exploring simulation theories, data, methodologies, and frameworks. We are also interested in case studies applying geosimulations to address real-world challenges. Potential topic areas include, but are not limited to:

Geosimulation Models and Applications
Conceptual Geosimulation Models
General-Purpose Geosimulation Framework
AI and Geosimulation
Agents’ Behaviors, Decision-making and AI Agents
Data Generation Framework
Validation and Verification for Geosimulation
Digital Twins
Microsimulation
Multi-agent Systems

If you are interested, please email your title and 250-word abstract to Fuzhen Yin (fyin@uccs.edu) and Jeon-Young Kang (geokang@khu.ac.kr) by October 30th.

Chairs:

Fuzhen Yin, University of Colorado Colorado Springs
Jeon-Young Kang, Kyung Hee University

Organizers:

Alison Heppenstall, University of Glasgow, Scotland.
Andrew Crooks, University at Buffalo, USA.
Na (Richard) Jiang, Hong Kong University of Science and Technology (Guangzhou), China
Fuzhen Yin, University of Colorado, Colorado Springs, USA.
Raja Sengupta, McGill University, Canada.
Suzana Dragicevic, Simon Fraser University, Canada.
Boyu Wang, University at Buffalo, USA.
Sarah Wise, University College London, England
Jeon-Young Kang, Kyung Hee University, South Korea
Yahya Gamal, University of Glasgow, Scotland.
Alexander Michels, University of Texas at Dallas, USA
Joon-Seok Kim, Emory University, USA

Sponsor Groups:

Friday, August 01, 2025

LLMs and ABMs

In a previous post we talked about the potential of Generative AI for urban modeling, keeping with this theme at the 11th International Conference on Computational Social Science (IC2S2), Na Jiang, Boyu Wang and myself had a poster entitled Agent-based Models with Large Language Models: Two Modeling Examples.

In this poster and extended abstract we detail how LLMs can help with many aspects of agent-based modeling development. If this sounds of interest, below you can see the abstract, the poster and the full referece and link to the extended abstract .

Abstract:

Large language models (LLMs) play an important role in AI-powered code assistants such as code completion, debugging, and documentation. Such models can be further fine-tuned on smaller amount of data for specific tasks, often with the improvement of performance compared to generic LLMs. However, such fine-tuning techniques are seldomly used in generating sophisticated agent-based models (ABMs), because they are often implemented as software that demands extra standards such as the Overview, Design concepts, and Details (ODD) protocol. This research examines how we can bridge this gap by utilizing LLMs in designing or conceptualizing, building, and running agent-based models in the form of user prompts. In this work, two models are created to demonstrate the proposed method. Specifically, Sakoda's checkerboard model of social interaction is created by LLM from explicit design and description through prompts. The other model stimulates consumer preferences and restaurant visits as designed and implemented by a LLM. These models are evaluated by human experts on their code correctness and quality for both verification and validation purposes. This work serves as a first step towards fine-tuned LLMs on existing models and documentations to create high-quality and functional ABMs based on either user prompts or standard protocols, contributing to further exploration on the future of AI-assisted geospatial simulation development.
Keywords: agent-based modeling, geospatial simulations, large language models, generative AI, coding

Full reference:

Jiang, N., Wang, B. and Crooks, A.T. (2025), Agent-based Models with Large Language Models: Two Modeling Examples, 11th International Conference on Computational Social Science (IC2S2), 21-24th July, Norrkoping, Sweden. (extended abstract pdf) (poster pdf)

Friday, July 18, 2025

Examining spatial expansion and stemming strategies of urban shrinkage

In the past we have written about how one can study urban shrinkage with a specific emphasis on Detroit from both an agent-based modeling perspective and also from analyzing newspapers through natural language processing Keeping with the theme of Detroit and urban shrinkage we (Xiaoliang Meng, Yichun Xie, Junyi Wu, Heather Khan Welsh, Shi Zeng and myself) have a new paper entitled "Examining spatial expansion and stemming strategies of urban shrinkage: evidence from Detroit, USA" which was recently published in npj Urban Sustainability.

In this paper we introduce a method for studying urban shrinkage by constructing multi-scale spatial structures based on urban network connectivity which we call gravity-networked spatial interaction zones-based spatial panel modeling or GSIZs-Spanel for short. We demonstrate this method by exploring the spatial processes and scopes of past urban shrinkage in Detroit between 2000 and 2020. If this sounds of interest, below you can read the abstract to the paper, along with the conceptual design of GSIZs-Spanel modeling framework and some of our results. While at the bottom of the post you can find the full referece and link to the paper.

Abstract:

This study introduces a new modeling paradigm called gravity-networked spatial interaction zones-based spatial panel modeling (GSIZs-Spanel). Using Detroit as a case study, this paper investigates urban shrinkage by integrating shrinkage driving factors, their regional interactions, networks of cities, spatial processes, and longitudinal dynamics. Results suggest that high minority population concentration and persistent poverty are the primary factors impacting Detroit’s inner-city shrinkage. Demographics, economics, and development practices affect shrinkage in suburbs and surrounding cities. Shrinkage spreads outwards like waves; different juxtapositions of driving factors affect shrinkage resilience; spillover effects are particularly vibrant at 25–50 GSIZs; rightsizing is a rational strategy, but it failed to work alone. Integrating spatial planning of driving factors, land uses, spillover effects, rightsizing strategy, and regional collaboration among federal, regional, and local organizations could moderate urban decline. GSIZs-Spanel, which was developed here, could be applied in any U.S. city or other global city.

The conceptual design of GSIZs-Spanel modeling framework.

Patterns of spillover effects of the Spanel models at the 5-incremental spatial clusters. (a: Spatial processes of urban shrinkage. b: Spatial patterns of vacancy severity.)

Spillover effects of the Spanel models at the 5-incremental spatial clusters

Full Reference:

Meng X., Xie, Y., Crooks, A.T., Wu J., Khan-Welsh, H. and Zen, S. (2025), Examining spatial expansion and stemming strategies of urban shrinkage: evidence from Detroit, USA, npj Urban Sustainability, 5: 52. Available at https://doi.org/10.1038/s42949-025-00245-5 (pdf)

Saturday, July 05, 2025

New Editorial: Generative AI and Urban Modeling

In the current issue of Environment and Planning B, we (Boyu Wang, Na Jiang and myself) have a new editorial entitled "Generative AI and Urban Modeling". The premise of this editorial is that Generative AI (GenAI) is impacting all aspects of our daily lives and as such has we were wondering how will it impact urban modeling?

For example, in the editorial we discuss how GenAI could speed up the overall urban modeling process. To demonstrate this we show how ChatGPT (and its built-in coding interface Canvas) can take published papers and build agent-based models from them (one being of an abstract space and another being spatially explicit).

However, while model building is time consuming task, another challenge modelers face is how to incorporate decision making within them. To this end we also discuss how large language models (LLMs) have the potential to help with agent-decision making in the form of generating agent-personas or scheduling agent activities.

We conclude the editorial with a series of questions: how will GenAI impact urban modeling? Will it open up the field to more people without the need for strong coding skills? Will we see growth in using LLMs for generating behavior? Will GenAI lead to a new generation of modeling toolkits? While these are only a short list of questions, they also raise concerns that relate back to some of the more thorny issues of urban modeling, that of verification and validation.

If this sounds of interest you can read the full editorial here.

Full Referece:

Crooks, A.T., Jiang, N. and Wang, B. (2025), Generative AI and Urban Modeling, Environment and Planning B, 52(6), 1277-1281. (pdf)

Monday, June 30, 2025

CUPUM 2025

I have just gotten back from attending the 19th International Conference on Computational Urban Planning and Urban Management (CUPUM) in London and thought I would share the two papers we presented at the conference.

The first paper was with Qingqing Chen and Linda See and was entitled "Using New Sources of Data for Urban Climate Modeling Generated through MLLMs on Street View Imagery. "As the title might suggest, this paper was about how one can leverage multi-modal large language models (MLLMs) to extract information on building height, age and function from street level photographs. We demonstrate this using street view images from Mapillary and than ask ChatGPT to estimate the building height, age and function and compare the results to authoritative data sources. If this sounds of interest, below you can see the abstract to the paper, some if the figures (i.e., the work flow and prompts) while the results can be seen in the attached paper (see the link below).

Abstract:

Urban climate and energy balance models require data on the form and function of buildings, but high resolution spatially explicit data sets are often lacking. Here we demonstrate how multi-modal large language models (MLLMs) can be used to extract information on building height, age and function from street level photographs for New York City. A workflow is presented that illustrates the approach, with initial results indicating that the building function can be identified with good accuracy while moderate accuracies were obtained for building heights and age. Suggestions for how to improve these accuracies are also provided.
KEYWORDS: Buildings, ChatGPT, Multi-modal Large Language Models (MLLMs), Mapillary, Street View Images (SVI).

An overview of research workflow.

The detailed description of multi-step prompting and an example of extracted building attributes information.

Full Reference:

Chen, Q., See, L. and Crooks, A.T. (2025), Using New Sources of Data for Urban Climate Modeling Generated through MLLMs on Street View Imagery. In Cramer-Greenbaum, S., Dennett, A., and Zhong, C (eds.), Proceedings of the 19th International Conference on Computational Urban Planning and Urban Management (CUPUM), London, UK. (pdf)

We then moved back to agent-based modeling with a paper with entitled "Enhancing Spatial Reasoning and Behavior in Urban ABMs with Large-Language Models and Geospatial Foundation Models" which brought back together Nick Malleson, Alison Heppenstall, Ed Manley and myself. In this paper we discuss the potential role of LLMs and geospatial foundation models in the context of agent-based modeling. If this sounds of interest, below you can read the abstract to the paper and find a link to it at the bottom of the post. Nick has also shared the slides of this presentation here.

Abstract:

Modeling human behavior continues to be a significant challenge for the field of agent-based modeling, and one that prohibits the development of comprehensive empirical ABMs for urban applications, such as Urban Digital Twins. However, two recent methodological advances offer the potential to transform empirical agent-based models.

Early evidence suggests that large-language models (LLMs) can be used to represent a wide range of human behaviors, with models responding in realistic ways to given prompts. Indeed there is already a flurry of activity that focusses on implementing LLM-backed agents -- i.e. agents who are controlled by LLMs. At the same time, the concept of the foundation model is also being applied in domains beyond text analysis. Of particular interest are geospatial foundation models that automatically encode spatial data in such a way as to associate different spatial objects in numerous and nuanced ways that have otherwise alluded manual classification schemes. Taken together, these two technologies offer considerable potential for a new generation of agent-based models that contain agents who can behave in response to spatial and social prompts in a way that is realistic and has so far proven impossible to replicate using manually-programmed behavioral rules.

This paper presents a discussion of the state of the art in both LLMs and geospatial foundation models in the context of their potential role in agent-based modelling. It discusses the transformational potential of these technologies and outlines the critical questions that need to be addressed before they can be used to create robust, reliable and trustworthy models for empirical policy applications that support decision-making.

KEYWORDS: Agent-based Modeling; Large language model; Geospatial foundation model; Urban Modeling.

Full Reference:

Malleson, N., Crooks, A.T., Heppenstall, A. and Manley, E. (2025), Enhancing Spatial Reasoning and Behavior in Urban ABMs with Large-Language Models and Geospatial Foundation Models. In Cramer-Greenbaum, S., Dennett, A., and Zhong, C (eds.), Proceedings of the 19th International Conference on Computational Urban Planning and Urban Management (CUPUM), London, UK. (pdf)

Saturday, June 21, 2025

Talks: ABM, AI and other Thoughts

This is a slightly different post to normal, in the sense its not really about papers but my take on agent-based modeling, urban analytics and the growth of Artificial Intelligence impacting both.

First up, while I was in Santa Fe last October for the 2024 International Conference of the Computational Social Science Society of the Americas I was interviewed by John Cordier from Epistemix for their Flux Podcast which resulted in this "From Micro-Behaviors to Macro-Patterns: Exploring Agent-Based Models with Andrew Crooks. Rather than me trying to sum it up I will just quote from the podcast episode

"In this episode of The Flux, host John Cordier sits down with Andrew Crooks ..... They dive into the world of agent-based modeling (ABM) - what it is, why it matters, and how it helps us simulate and better understand human behavior in complex systems. From simulating traffic jams to modeling social influence on vaccine uptake, Andrew shares how data, geography, and synthetic populations are revolutionizing our ability to forecast and inform decisions. They also explore the growing role of AI tools in democratizing modeling, the evolution of computational capabilities, and even ask: what if we had run a simulation before Brexit?"

If this sounds of interest, you can listen to the full podcast here.

Next up, I was asked to give a talk back in late May to give a seminar talk at the Department of Geography and Spatial Sciences (GSS) at the University at Delaware hosted by Yao Hu. The title of the talk was "Monitoring and Analyzing Cities through the Lens of Urban Analytics" In this talk I reflect what urban analytics means to me and how the field is changing. If this sounds of interest, below you can read the abstract to my talk and also see the recording. However, before ending this I would really like to thank Yao for hosting me, and the others from the GSS and the universty at large for making it a great visit and being an engaged audience.

Abstract:

For the first time in human history, more people are living in cities than rural areas and this trend is only expected to grow in the coming decades. This growth will place unprecedented challenges on cites with respect to sustainable development especially in light of climate change and increasing populations. One way to explore and understand cities is through the lens of urban analytics, a set of methods that allow us to monitor, analyze and model urban areas. This talk will explore how urban analytics has changed over time and showcase how our understanding of cities has benefited from it. I will showcase how new sources of data can be used to monitor and analyze cities and how in turn these can be integrated into models to explore various aspects of city life from pedestrian movement to urban growth. The talk will conclude with a discussion and demonstration of how artificial intelligence can be integrated into the urban analytics toolbox and what opportunities and challenges it poses.

Also in late May, Alison Heppenstall, and myself were interviewed by Dr. Andy Collins discussing as part of the Computational Social Science Society of the Americas (CSSSA) webinar series on Agent-based modeling and simulation (ABMS). To quote from CSSSA, the purpose of these webinars is that:

"Agent-based modeling and simulation (ABMS) has been applied far and wide to better understand our world. Each new application domain brings with it existing cultures of the domain's experts, including expectations and requirements. As such, it is foolhardy to expect agent-based modeling to be standardized across all domains. As practitioners, there is a desire to understand how these domain cultures differ, how they use agent-based modeling, and what the future of agent-based modeling is within those domains. To start to grapple with these grand questions, for the ABMS community, we are proposing to run a series of interviews with experts from different domains to try to map the world of agent-based modeling."

Readers, might not be surprised but we were asked to discuss ABM in the context of geography. So if you want to hear us discuss ABM and geography, you can see the talk below. It should also be noted the CSSSA has a whole host of other webinars on their YouTube Channel.

Finally, at the start of May, I was invited to give one of the keynotes at the Inaugural AI and Cities: An International Forum for Innovation and Collaboration hosted by University of Florida entitled "Artificial intelligence and Urban Analytics: Opportunities and Challenges." This talk is slightly different from the others as the focus was more on AI, so if you are wondering what my take on AI is (or my current research), you can read the abstract to the talk below and also find a link to the recording of it.

Abstract: Urban areas now provide homes for more people than ever before, and with more and more people living in cities achieving sustainable cities is crucial for the betterment of all. Coinciding with the growth of the world’s population is the growth of artificial intelligence (AI) is which is becoming pervasive in all aspects of our daily lives. In this talk I will discuss how AI is offering us new opportunities when it come studying cities, specifically, through the lens of urban analytics. Urban analytics can be broadly defined a set of methods to explore, understand and predict the properties of cities. Through a series of examples, I will highlight how AI especially through the use of multimodal large language models (LLMs) is offering accessible methods for geographic information extraction and modeling of cities. I will showcase how AI can improve the granularity of urban data collection while at the same time provides more advanced GIS tools to practitioners in a more accessible and user-friendly way. However, AI alone is not the panacea when it comes to archiving urban sustainability and many challenges exist and the talk with conclude with these.

If the abstract sounds interesting click here to watch the talk. Also the other keynotes talks are also available online here.

Tuesday, May 13, 2025

Crowdsourcing dust storms utilizing social media data

In the past we have explored how social media can be used to delineate earthquakes, study human-wildlife interactions, understand urban morphology, urban smells or locating wildfires among many other things.

Keeping with the last topic (i.e., locating things), in a new paper published in GeoJournal entitled "Crowdsourcing dust storms in the United States utilizing social media data," Stuart Evans, Festus Adegbola and myself explore how we can use X (formerly Twitter) and Flickr to source observations of windblown dust.

As such the paper demonstrates how social media data can act as supplementary source for dust events monitoring and captures the seasonal trends of such events. Furthermore, the paper highlights the potential of using crowdsourced data for the often overlooked field of dust monitoring that has substantial health and economic impacts.

If this sounds of interest, below we provide the abstract to the paper along with some figures which showcase our methodology and comparison with National Weather Service dust advisories and VIIRS satellite data. At the bottom of the post, you can find the full reference to the paper along with a link to it.

Abstract:

Dust storms and other dust events are natural phenomena characterized by strong winds carrying large amounts of fine particles which have significant environmental and human impacts. However, capturing the occurrence of such phenomena is a challenge. Previous studies have limitations due to available data, especially regarding short-lived, intense dust storms and events that are not captured by observing stations and satellite instruments. In recent years, the advent of social media platforms has provided a unique opportunity to access vast amounts of crowdsourced data. This paper explores the utilization of Flickr and X (Twitter) data to study dust event occurrences within the United States and their correlation with National Weather Service (NWS) advisories. The work ascertains the reliability of using crowdsourced data as a supplementary source for dust events monitoring. Our analysis of Flickr and X indicates that the Southwest region is most susceptible to dust events, with Arizona leading in the highest number of occurrences. On the other hand, the Great Plains show a scarcity of crowdsourced data related to dust events, which can be attributed to the sparsely populated nature of the region. Furthermore, seasonal analysis reveals that dust events are prevalent during the Summer months followed by Spring. These results are consistent with previous traditional studies that did not use social media of dust occurrences in the U.S., and Flickr-identified images of dust events show substantial co-occurrence with regions of NWS dust warnings. This paper highlights the potential of using crowdsourced data for the often overlooked field of dust monitoring that has substantial health and economic impacts.

Keywords: Dust storms, Crowdsourcing, Social media, Weather.

Flowchart of our workflow

Selected posts retrieved from X showing active dust events.

Selected images retrieved from Flickr showing active dust events.

Map showing the distribution of flickr-identified dust event occurrences, X-identified dust event occurrences, National Weather Service dust advisories, including dust storm (DS) warnings and blowing dust (DU) advisories.

Seasonal cycle of dust events using social media metadata, the National Weather Service advisories, and the VIIRS satellite data.

Examples of social media identified dust events and satellite observations for the same day. Brown shaded pixels indicate locations Suomi-VIIRS observed dust particles. Any VTEC warnings issued by NWS for the location are shown after the date of each dust event, with HWW and DSW indicating High Wind Warning and Dust Storm Warning, respectively.

Full Referece:

Adegbola, F., Crooks, A.T. and Evans, S.M. (2025). Crowdsourcing dust storms in the United States utilizing social media data. GeoJournal, 90(3), pp.1-18. Available at https://doi.org/10.1007/s10708-025-11359-9 (pdf)

Pages

Monday, December 15, 2025

Friday, December 12, 2025

Friday, November 28, 2025

Saturday, November 08, 2025

Thursday, November 06, 2025

Thursday, October 09, 2025

Friday, August 01, 2025

Friday, July 18, 2025

Saturday, July 05, 2025

Monday, June 30, 2025

Saturday, June 21, 2025

Tuesday, May 13, 2025