AIBullisharXiv – CS AI · Jun 197/10
🧠Researchers released SARLO-80, a large-scale dataset combining very-high-resolution synthetic aperture radar (SAR) imagery, aligned optical images, and natural-language descriptions across 2,500 worldwide scenes. The dataset addresses a critical gap in multimodal AI training by preserving complex-valued SAR measurements and native acquisition geometry, enabling more physically grounded foundation models for Earth observation applications.
🏢 Hugging Face
AIBullisharXiv – CS AI · Jun 107/10
🧠Earth-OneVision is a 2 billion-parameter remote sensing multimodal large language model that unifies six sensor modalities (optical, SAR, infrared, multispectral, temporal, and video) and performs nine task categories through a single framework. The model achieves competitive or superior performance compared to larger models (4B-72B parameters) on multiple benchmarks, supported by a new 34M QA pair dataset spanning cross-sensor fusion applications.
AIBearisharXiv – CS AI · May 117/10
🧠Researchers introduce CloudWeb, an adversarial attack that manipulates remote sensing images with realistic cloud and haze patterns to hijack vision-language retrieval systems in multimodal RAG pipelines. The attack achieves significant success rates—increasing weather-related evidence injection from 0.71% to 43.29% on benchmark tests—demonstrating that input-space threats to retrieval stages remain largely undefended in production systems.
🏢 OpenAI
AIBullisharXiv – CS AI · Apr 107/10
🧠Researchers introduce RS-EoT (Remote Sensing Evidence-of-Thought), a novel framework that enables vision-language models to reason more effectively about satellite imagery by iteratively seeking visual evidence rather than relying on linguistic patterns. The approach uses a self-play multi-agent system called SocraticAgent and reinforcement learning to address the 'Glance Effect,' where models superficially analyze large-scale remote sensing images, achieving state-of-the-art performance on multiple benchmarks.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers introduce GeoSeg, a zero-shot, training-free framework for AI-driven segmentation of remote sensing imagery that uses multimodal language models for reasoning without requiring specialized training data. The system addresses domain-specific challenges in satellite and aerial image analysis through bias-aware coordinate refinement and dual-route prompting mechanisms.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers analyze how vision-language models perform zero-shot remote sensing tasks across multiple datasets and find that textual design choices critically impact performance. The study reveals that semantically rich LLM-generated descriptions don't consistently outperform simpler template-based descriptions due to noise in text embeddings, but lightweight query embedding calibration effectively improves results.
AINeutralarXiv – CS AI · Jun 196/10
🧠Researchers introduce RS-Neg, the first benchmark for evaluating negation comprehension in Remote Sensing Multimodal Large Language Models, revealing significant limitations in understanding what is absent or false. They propose NeFo, a test-time learning method that improves negation understanding using just 5% of unlabeled samples, addressing a critical gap for real-world emergency response applications.
AINeutralarXiv – CS AI · Jun 105/10
🧠Researchers propose CSI-Net, a deep learning architecture that improves change detection in remote sensing images by effectively integrating spatial and spectral information while suppressing noise from unchanged areas. The model demonstrates superior performance across multiple satellite imagery datasets, advancing capabilities for applications like environmental monitoring and urban planning.
AINeutralarXiv – CS AI · Jun 106/10
🧠Researchers have developed MSI-Net, a deep learning model for detecting building damage in post-earthquake satellite imagery, and introduced the TUE-CD dataset based on the Turkey earthquake. The solution addresses the challenge of analyzing remote sensing images with short time intervals and varying imaging angles to support emergency response operations.
AINeutralarXiv – CS AI · Jun 96/10
🧠Researchers propose a bidirectional semantic complementary tool retrieval (BSCTR) method to improve how LLM-based agents select appropriate tools for remote sensing tasks. The approach addresses a fundamental mismatch between high-level user queries and detailed tool documentation by enhancing queries with decomposed subtasks and enriching tool descriptions with contextual dependencies, demonstrating improved performance on specialized remote sensing benchmarks.
AINeutralarXiv – CS AI · Jun 96/10
🧠Researchers present a deep learning framework using set-based transformers to compensate for atmospheric effects in long-wave infrared hyperspectral imaging. The method processes multiple radiance measurements at different distances to estimate transmittance, atmospheric path radiance, and downwelling spectrum with minimal spectral distortion, addressing a historically overlooked challenge in standoff imaging applications.
AINeutralarXiv – CS AI · Jun 95/10
🧠PolyBuild introduces an end-to-end deep learning method for extracting building polygon contours directly from high-resolution remote sensing images without post-processing. The hybrid CNN-Transformer architecture combines an Initial Contour Generation Module with a Contour Optimization Module to achieve superior performance over existing mask-based and contour-based approaches.
$MATIC
AINeutralarXiv – CS AI · Jun 95/10
🧠Researchers present an enhanced machine learning framework for classifying airborne multispectral point cloud data by combining geometric and spectral features through dual-stream attention mechanisms. The method addresses challenges in high-dimensional data processing and sample imbalance, demonstrating improved classification accuracy on new benchmark datasets.
AINeutralarXiv – CS AI · Jun 85/10
🧠Researchers propose PCD-Net, a neural network framework that combines physics-based split window algorithms with machine learning to improve land surface temperature retrieval from satellite thermal infrared data. The approach adaptively learns dynamic coefficients for atmospheric correction, addressing limitations of traditional fixed-coefficient methods and enhancing generalization across diverse environmental conditions.
AINeutralarXiv – CS AI · Jun 56/10
🧠Researchers introduce ATT-CR, a Transformer-based model that improves cloud removal in remote sensing images by reducing computational complexity and filtering cloudy pixel interference. The innovation combines Triangular Attention with lower computational costs (O(N)) and a Feature Selected Gating Module to distinguish between valid and invalid features, addressing scalability limitations in existing Transformer approaches.
AINeutralarXiv – CS AI · Jun 56/10
🧠Researchers introduce CangLing-KnowFlow, an AI agent framework designed to automate complex remote sensing and Earth observation tasks across diverse applications. The system combines a knowledge base of 1,008 expert-validated workflows with dynamic error recovery and continuous learning capabilities, outperforming baseline models by 4% or more on standardized benchmarks.
AINeutralarXiv – CS AI · Jun 46/10
🧠Researchers propose an optical-guided neural collapse framework for SAR few-shot class incremental learning that addresses data scarcity and catastrophic forgetting by transferring geometric structure from optical imagery to SAR domain. The method achieves superior performance on benchmark datasets while maintaining better feature compactness and inter-class separability compared to existing FSCIL approaches.
AINeutralarXiv – CS AI · Jun 46/10
🧠Researchers introduce LaVIDE, a novel AI framework that uses language as a bridge to detect changes between satellite maps and updated imagery, overcoming semantic gaps between high-level map data and low-level image details. The approach achieves significant performance improvements across four benchmarks and offers practical applications for rapid map updating in urban planning and disaster assessment.
AINeutralarXiv – CS AI · Jun 26/10
🧠DarkVesselNet is a multi-modal AI system that detects unregistered vessels by combining satellite radar and optical imagery with AIS trajectory data and anomaly detection algorithms. The open-source framework addresses maritime surveillance challenges and is available as both a Python package and public Hugging Face interface.
🏢 Hugging Face
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers introduce CAFOSat, a large-scale annotated dataset containing over 45,000 image patches for mapping Concentrated Animal Feeding Operations across the United States using high-resolution satellite imagery. The dataset combines AI-assisted annotation, human verification, and infrastructure-level labeling to address challenges in automated CAFO detection, benchmarking multiple deep learning models for improved agricultural monitoring capabilities.
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers introduce LALE, a lightweight transformer architecture for remote sensing image segmentation that achieves strong efficiency-performance trade-offs by separating high-resolution local feature processing (via ConvMixer) from low-resolution global context modeling (via transformers). The approach demonstrates that a 1.6M parameter model can match near-SOTA performance while requiring 4.5x fewer parameters and 17x fewer computational operations.
AINeutralarXiv – CS AI · May 296/10
🧠Researchers introduce MineC2FNet, a deep learning framework that leverages abundant coarse-grained remote sensing data to improve fine-grained mining footprint segmentation in multispectral imagery. The approach uses domain incremental learning with attentive distillation to bridge the gap between coarse and fine datasets, addressing a critical gap in environmental monitoring of global mining operations.
AINeutralarXiv – CS AI · May 286/10
🧠FLORO is a multimodal geospatial foundation model that learns from diverse remote sensing data across multiple sensor types and resolutions with minimal pretraining data. Despite using significantly smaller datasets than competing models, FLORO demonstrates strong transfer learning performance on ecological and environmental applications, achieving competitive results on scene classification, segmentation, and regression tasks.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce WATCH, a satellite-based framework using foundation models to detect disturbances at archaeological sites across months and years. The system combines three approaches—temporal embedding distance, self-supervised change detection, and weakly supervised learning—achieving up to 92.5% accuracy within three-month tolerance windows when monitoring 1,943 Afghan sites and cross-validating in Syria, Turkey, Pakistan, and Egypt.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers are using large language models combined with remote sensing imagery to analyze built environments for smart city applications, evaluating models like InternVL and Qwen for tasks including design suggestions, constructability assessment, and risk identification. The study demonstrates that multimodal AI systems can effectively process satellite imagery at multiple scales to support urban planning and infrastructure decision-making.