- IN41A-06: Pretrain Here: How the Spatial Distribution of Pretraining Data Shapes Remote Sensing Transferability (highlighted)
-
NOLA CC
Author(s):Generic 'disconnected' Message
Amandeep Kaur, Arizona State University (First Author, Presenting Author)
Esther Rolf, University of Colorado at Boulder
Hannah Kerner, Arizona State University
Earth observation satellites collect massive amounts of unlabelled imagery every day, which researchers use to train artificial intelligence (AI) models. These models can help monitor land use, ecosystems, and even estimate population. However, there's currently no clear guideline on where this training data should come from; should it be sampled evenly across the globe, or focused on certain continents or ecosystems, etc? In this study, we tested how the geographic origin of training data affects how well AI models perform on global tasks. We trained several popular Earth observation models using data sampled from different regions and then measured how well they worked on problems like land-use classification, map segmentation, and estimating surface-level data. We found that where the data comes from, its geographic and environmental diversity has a big impact on how useful the models are. Our findings can help researchers build better, more general-purpose models for monitoring the planet.
Scientific DisciplineSuggested ItinerariesNeighborhoodType
Enter Note
Go to previous page in this tab
Session


