AIBullisharXiv – CS AI · 5d ago7/10
🧠Researchers introduce YUBI, a finger-aligned gripper that improves upon existing data collection systems for robotic manipulation by enabling more ergonomic, intuitive bimanual control. The team released an unprecedented 8,434-hour dataset across 1.20M episodes and demonstrated that policies trained on YUBI data transfer successfully across multiple robot platforms, advancing the development of robotic foundation models.
AINeutralThe Verge – AI · 4d ago6/10
🧠Google is implementing a new 'Search Services History' setting that will save images, audio, video, and files from Google Lens, Search Live, voice searches, and Translate for AI training purposes. Users can disable this feature, but the change reflects Google's broader effort to collect multimodal data for training its AI models.
GeneralBearishDaily Hodl · Jun 66/10
📰Google has agreed to settle an $8.25 million class action lawsuit alleging the tech giant illegally collected personal information from children under 13 without parental consent through its AdMob platform. The 2023 lawsuit, filed in California's Northern District, highlights ongoing regulatory scrutiny of big tech companies' data practices involving minors and their compliance with child privacy laws.
AINeutralThe Verge – AI · May 296/10
🧠AI training startup Shift is offering free home cleaning services in New York with plans to expand to other cities, but requires video footage of cleaners performing domestic tasks. The company aims to collect training data for robotics companies developing household automation technology, exemplifying how AI firms are increasingly monetizing everyday human activities.
AINeutralArs Technica – AI · May 296/10
🧠A startup is offering free home cleaning services to customers willing to wear head cameras during the process, with footage used to train robots for future automation. This represents an emerging trend where companies incentivize data collection from human workers to develop AI and robotics capabilities.
AINeutralThe Verge – AI · May 296/10
🧠AI training startup Shift is offering free home cleaning services with a novel catch: it will record cleaners to generate training data for robot development. The company argues that the value of this footage sufficiently subsidizes the service, creating a barter economy where homeowners receive clean homes while Shift obtains valuable AI training material.
CryptoBullishEthereum Foundation Blog · Aug 145/103
⛓️The Ethereum Foundation's Ecodev Coordinators team awarded $557,660 in grants to 11 recipients in their Data Collection grant round. The selected projects were chosen for their innovative approaches to advancing data collection and analysis within the Ethereum ecosystem.
$ETH
AINeutralCrypto Briefing · Mar 265/10
🧠Jake Loosararian discusses the critical importance of data collection in robotics for achieving efficiency, while highlighting concerns about Nvidia's market dominance potentially limiting hardware diversity. The analysis emphasizes determinism as a crucial factor for future robotics advancements, particularly in energy and defense sectors where AI-driven systems manage critical infrastructure.
🏢 Nvidia
AINeutralLil'Log (Lilian Weng) · Feb 54/10
🧠The article discusses the critical importance of high-quality human-labeled data for training modern deep learning models, particularly for classification tasks and RLHF labeling used in LLM alignment. Despite the recognized value of quality data, there's a notable preference in the ML community for model development work over data collection and annotation work.
AIBullishOpenAI News · Oct 114/105
🧠Typeform is integrating GPT-3.5 and GPT-4 to transform traditional online forms into dynamic, conversational data collection experiences. This represents a practical application of AI technology to enhance user interaction and data gathering processes.