y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#semantic-segmentation News & Analysis

7 articles tagged with #semantic-segmentation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

7 articles
AIBullisharXiv โ€“ CS AI ยท Feb 277/108
๐Ÿง 

A Confidence-Variance Theory for Pseudo-Label Selection in Semi-Supervised Learning

Researchers introduce a Confidence-Variance (CoVar) theory framework that improves pseudo-label selection in semi-supervised learning by combining maximum confidence with residual-class variance. The method addresses overconfidence issues in deep networks and demonstrates consistent improvements across multiple datasets including PASCAL VOC, Cityscapes, CIFAR-10, and Mini-ImageNet.

$NEAR
AIBullisharXiv โ€“ CS AI ยท Mar 116/10
๐Ÿง 

Grounding Synthetic Data Generation With Vision and Language Models

Researchers introduce ARAS400k, a large-scale remote sensing dataset containing 400k images (100k real, 300k synthetic) with segmentation maps and descriptions. The study demonstrates that combining real and synthetic data consistently outperforms training on real data alone for semantic segmentation and image captioning tasks.

AINeutralarXiv โ€“ CS AI ยท Apr 74/10
๐Ÿง 

TreeGaussian: Tree-Guided Cascaded Contrastive Learning for Hierarchical Consistent 3D Gaussian Scene Segmentation and Understanding

TreeGaussian introduces a new framework for 3D scene understanding that uses tree-guided cascaded contrastive learning to better capture hierarchical semantic relationships in complex 3D environments. The method addresses limitations in existing 3D Gaussian Splatting approaches by implementing structured learning across object-part hierarchies and improving segmentation consistency.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation?

Researchers introduce Stepping Stone Plus (SSP), a novel framework that combines optical flow and textual prompts to improve audio-visual semantic segmentation. The method outperforms existing approaches by using motion dynamics for moving sound sources and textual descriptions for stationary objects, with a visual-textual alignment module for better cross-modal integration.

AIBullishHugging Face Blog ยท Jan 194/105
๐Ÿง 

Universal Image Segmentation with Mask2Former and OneFormer

This article discusses Universal Image Segmentation techniques using Mask2Former and OneFormer architectures. These are advanced computer vision models that can perform multiple segmentation tasks in a unified framework, representing significant progress in AI image understanding capabilities.