y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#remote-sensing News & Analysis

27 articles tagged with #remote-sensing. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

27 articles
AIBearisharXiv – CS AI · May 117/10
🧠

From Clouds to Hallucinations: Atmospheric Retrieval Hijacking in Remote Sensing Vision-Language RAG

Researchers introduce CloudWeb, an adversarial attack that manipulates remote sensing images with realistic cloud and haze patterns to hijack vision-language retrieval systems in multimodal RAG pipelines. The attack achieves significant success rates—increasing weather-related evidence injection from 0.71% to 43.29% on benchmark tests—demonstrating that input-space threats to retrieval stages remain largely undefended in production systems.

🏢 OpenAI
AIBullisharXiv – CS AI · Apr 107/10
🧠

Asking like Socrates: Socrates helps VLMs understand remote sensing images

Researchers introduce RS-EoT (Remote Sensing Evidence-of-Thought), a novel framework that enables vision-language models to reason more effectively about satellite imagery by iteratively seeking visual evidence rather than relying on linguistic patterns. The approach uses a self-play multi-agent system called SocraticAgent and reinforcement learning to address the 'Glance Effect,' where models superficially analyze large-scale remote sensing images, achieving state-of-the-art performance on multiple benchmarks.

AIBullisharXiv – CS AI · Mar 56/10
🧠

GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

Researchers introduce GeoSeg, a zero-shot, training-free framework for AI-driven segmentation of remote sensing imagery that uses multimodal language models for reasoning without requiring specialized training data. The system addresses domain-specific challenges in satellite and aerial image analysis through bias-aware coordinate refinement and dual-route prompting mechanisms.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Bidirectional Semantic Complementary Tool Retrieval for Remote Sensing Agents

Researchers propose a bidirectional semantic complementary tool retrieval (BSCTR) method to improve how LLM-based agents select appropriate tools for remote sensing tasks. The approach addresses a fundamental mismatch between high-level user queries and detailed tool documentation by enhancing queries with decomposed subtasks and enriching tool descriptions with contextual dependencies, demonstrating improved performance on specialized remote sensing benchmarks.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Set-Based Transformer for Atmospheric Compensation in Standoff LWIR Hyperspectral Imaging

Researchers present a deep learning framework using set-based transformers to compensate for atmospheric effects in long-wave infrared hyperspectral imaging. The method processes multiple radiance measurements at different distances to estimate transmittance, atmospheric path radiance, and downwelling spectrum with minimal spectral distortion, addressing a historically overlooked challenge in standoff imaging applications.

AINeutralarXiv – CS AI · 4d ago5/10
🧠

PolyBuild: An End-to-End Method for Polygonal Building Contour Extraction from High-Resolution Remote Sensing Images

PolyBuild introduces an end-to-end deep learning method for extracting building polygon contours directly from high-resolution remote sensing images without post-processing. The hybrid CNN-Transformer architecture combines an Initial Contour Generation Module with a Contour Optimization Module to achieve superior performance over existing mask-based and contour-based approaches.

$MATIC
AINeutralarXiv – CS AI · 4d ago5/10
🧠

An Enhanced Geometric-Spectral Feature Learning Framework for Airborne Multispectral Point Cloud Classification

Researchers present an enhanced machine learning framework for classifying airborne multispectral point cloud data by combining geometric and spectral features through dual-stream attention mechanisms. The method addresses challenges in high-dimensional data processing and sample imbalance, demonstrating improved classification accuracy on new benchmark datasets.

AINeutralarXiv – CS AI · 5d ago5/10
🧠

A Mechanism-Coupled Split Window Network for Medium- to High-Resolution Land Surface Temperature Retrieval

Researchers propose PCD-Net, a neural network framework that combines physics-based split window algorithms with machine learning to improve land surface temperature retrieval from satellite thermal infrared data. The approach adaptively learns dynamic coefficients for atmospheric correction, addressing limitations of traditional fixed-coefficient methods and enhancing generalization across diverse environmental conditions.

AINeutralarXiv – CS AI · Jun 56/10
🧠

ATT-CR: Adaptive Triangular Transformer for Cloud Removal

Researchers introduce ATT-CR, a Transformer-based model that improves cloud removal in remote sensing images by reducing computational complexity and filtering cloudy pixel interference. The innovation combines Triangular Attention with lower computational costs (O(N)) and a Feature Selected Gating Module to distinguish between valid and invalid features, addressing scalability limitations in existing Transformer approaches.

AINeutralarXiv – CS AI · Jun 56/10
🧠

CangLing-KnowFlow: A Unified Knowledge-and-Flow-fused Agent for Comprehensive Remote Sensing Applications

Researchers introduce CangLing-KnowFlow, an AI agent framework designed to automate complex remote sensing and Earth observation tasks across diverse applications. The system combines a knowledge base of 1,008 expert-validated workflows with dynamic error recovery and continuous learning capabilities, outperforming baseline models by 4% or more on standardized benchmarks.

AINeutralarXiv – CS AI · Jun 46/10
🧠

Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning

Researchers propose an optical-guided neural collapse framework for SAR few-shot class incremental learning that addresses data scarcity and catastrophic forgetting by transferring geometric structure from optical imagery to SAR domain. The method achieves superior performance on benchmark datasets while maintaining better feature compactness and inter-class separability compared to existing FSCIL approaches.

AINeutralarXiv – CS AI · Jun 46/10
🧠

LaVIDE: Language-Prompted Satellite Change Detection via Map-Image Alignment

Researchers introduce LaVIDE, a novel AI framework that uses language as a bridge to detect changes between satellite maps and updated imagery, overcoming semantic gaps between high-level map data and low-level image details. The approach achieves significant performance improvements across four benchmarks and offers practical applications for rapid map updating in urban planning and disaster assessment.

AINeutralarXiv – CS AI · Jun 26/10
🧠

DarkVesselNet: Multi-Modal Remote Sensing and Trajectory Reasoning for Dark Vessel Detection

DarkVesselNet is a multi-modal AI system that detects unregistered vessels by combining satellite radar and optical imagery with AIS trajectory data and anomaly detection algorithms. The open-source framework addresses maritime surveillance challenges and is available as both a Python package and public Hugging Face interface.

🏢 Hugging Face
AINeutralarXiv – CS AI · Jun 26/10
🧠

CAFOSat: A Strongly Annotated Dataset for Infrastructure-Aware CAFO Mapping Using High-Resolution Imagery

Researchers introduce CAFOSat, a large-scale annotated dataset containing over 45,000 image patches for mapping Concentrated Animal Feeding Operations across the United States using high-resolution satellite imagery. The dataset combines AI-assisted annotation, human verification, and infrastructure-level labeling to address challenges in automated CAFO detection, benchmarking multiple deep learning models for improved agricultural monitoring capabilities.

AINeutralarXiv – CS AI · Jun 26/10
🧠

LALE: Lightweight-Transformer Architecture for Land-Cover Estimation

Researchers introduce LALE, a lightweight transformer architecture for remote sensing image segmentation that achieves strong efficiency-performance trade-offs by separating high-resolution local feature processing (via ConvMixer) from low-resolution global context modeling (via transformers). The approach demonstrates that a 1.6M parameter model can match near-SOTA performance while requiring 4.5x fewer parameters and 17x fewer computational operations.

AINeutralarXiv – CS AI · May 296/10
🧠

Coarse-to-Fine Domain Incremental Learning with Attentive Distillation for Mining Footprint Segmentation in Multispectral Imagery

Researchers introduce MineC2FNet, a deep learning framework that leverages abundant coarse-grained remote sensing data to improve fine-grained mining footprint segmentation in multispectral imagery. The approach uses domain incremental learning with attentive distillation to bridge the gap between coarse and fine datasets, addressing a critical gap in environmental monitoring of global mining operations.

AINeutralarXiv – CS AI · May 286/10
🧠

FLORO: A Multimodal Geospatial Foundation Model for Ecological Remote Sensing Across Sensors and Scales

FLORO is a multimodal geospatial foundation model that learns from diverse remote sensing data across multiple sensor types and resolutions with minimal pretraining data. Despite using significantly smaller datasets than competing models, FLORO demonstrates strong transfer learning performance on ecological and environmental applications, achieving competitive results on scene classification, segmentation, and regression tasks.

AINeutralarXiv – CS AI · May 126/10
🧠

WATCH: Wide-Area Archaeological Site Tracking for Change Detection

Researchers introduce WATCH, a satellite-based framework using foundation models to detect disturbances at archaeological sites across months and years. The system combines three approaches—temporal embedding distance, self-supervised change detection, and weakly supervised learning—achieving up to 92.5% accuracy within three-month tolerance windows when monitoring 1,943 Afghan sites and cross-validating in Syria, Turkey, Pakistan, and Egypt.

AINeutralarXiv – CS AI · May 126/10
🧠

Built Environment Reasoning from Remote Sensing Imagery Using Large Vision--Language Models

Researchers are using large language models combined with remote sensing imagery to analyze built environments for smart city applications, evaluating models like InternVL and Qwen for tasks including design suggestions, constructability assessment, and risk identification. The study demonstrates that multimodal AI systems can effectively process satellite imagery at multiple scales to support urban planning and infrastructure decision-making.

AINeutralarXiv – CS AI · May 116/10
🧠

DPG-CD: Depth-Prior-Guided Cross-Modal Joint 2D-3D Change Detection

Researchers introduce DPG-CD, a deep learning framework that detects both 2D semantic and 3D structural changes in urban environments by fusing multi-temporal satellite imagery with Digital Surface Model data. The method addresses the challenge of combining different data modalities to enable high-frequency urban monitoring and disaster assessment without requiring expensive frequent 3D data collection.

AINeutralarXiv – CS AI · May 116/10
🧠

LithoBench: Benchmarking Large Multimodal Models for Remote-Sensing Lithology Interpretation

LithoBench introduces a comprehensive benchmark dataset for evaluating large multimodal models on remote-sensing lithology interpretation, containing 10,000 expert-annotated instances across cognitive levels from identification to reasoning. The research reveals significant gaps in current vision-language models' ability to handle knowledge-intensive geological tasks, highlighting the challenges of applying general-purpose AI to specialized domain expertise.

AIBullisharXiv – CS AI · Mar 116/10
🧠

Grounding Synthetic Data Generation With Vision and Language Models

Researchers introduce ARAS400k, a large-scale remote sensing dataset containing 400k images (100k real, 300k synthetic) with segmentation maps and descriptions. The study demonstrates that combining real and synthetic data consistently outperforms training on real data alone for semantic segmentation and image captioning tasks.

AIBullisharXiv – CS AI · Mar 36/108
🧠

GRAD-Former: Gated Robust Attention-based Differential Transformer for Change Detection

Researchers introduce GRAD-Former, a novel AI framework for detecting changes in satellite imagery that outperforms existing methods while using fewer computational resources. The system uses gated attention mechanisms and differential transformers to more efficiently identify semantic differences in very high-resolution satellite images.

AINeutralarXiv – CS AI · Mar 36/104
🧠

Near--Real-Time Conflict-Related Fire Detection in Sudan Using Unsupervised Deep Learning

Researchers developed a lightweight AI model using unsupervised deep learning to detect conflict-related fires in Sudan within 24-30 hours using commercially available satellite imagery. The Variational Auto-Encoder (VAE) approach outperformed traditional methods in identifying burn signatures from 4-band Planet Labs satellite data at 3-meter resolution.

$CRV$NEAR
Page 1 of 2Next →