y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#machine-learning News & Analysis

2519 articles tagged with #machine-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2519 articles
AIBullishOpenAI News · May 36/104
🧠

AI safety via debate

A new AI safety technique is proposed that involves training AI agents to debate topics with each other, with humans serving as judges to determine winners. This approach aims to improve AI safety through adversarial training and human oversight.

AIBullishOpenAI News · Apr 186/105
🧠

Evolved Policy Gradients

Researchers have released Evolved Policy Gradients (EPG), an experimental metalearning approach that evolves the loss function of AI learning agents to enable faster training on new tasks. The method allows agents to generalize beyond their training data, successfully performing basic tasks in novel scenarios they weren't specifically trained for.

AIBullishOpenAI News · Feb 266/106
🧠

Ingredients for robotics research

OpenAI is releasing eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay, tools developed for their robotics research. These environments have been used to train models that successfully work on physical robots, and the company is also releasing research requests for the robotics community.

AIBullishOpenAI News · Feb 155/105
🧠

Interpretable machine learning through teaching

Researchers have developed a machine learning method that enables AIs to teach each other using examples that are also interpretable by humans. The approach automatically identifies the most informative examples to convey concepts, such as selecting optimal images to represent dogs, and has shown effectiveness in teaching both artificial intelligence systems.

AINeutralOpenAI News · Jan 315/104
🧠

Requests for Research 2.0

OpenAI has released a new batch of seven unsolved research problems as part of their Requests for Research 2.0 initiative. This represents OpenAI's continued effort to crowdsource solutions to challenging problems they've encountered in their AI research work.

AIBullishOpenAI News · Dec 66/107
🧠

Block-sparse GPU kernels

A company has released highly-optimized GPU kernels for block-sparse neural network architectures that can run orders of magnitude faster than existing solutions like cuBLAS or cuSPARSE. These kernels have achieved state-of-the-art results in text sentiment analysis and generative modeling applications.

AIBullishOpenAI News · Oct 266/106
🧠

Learning a hierarchy

Researchers have developed a hierarchical reinforcement learning algorithm that learns high-level actions to efficiently solve complex tasks requiring thousands of timesteps. The algorithm was successfully applied to navigation problems, where it discovered high-level actions for walking and crawling in different directions, enabling rapid mastery of new navigation tasks.

AINeutralLil'Log (Lilian Weng) · Sep 286/10
🧠

Anatomize Deep Learning with Information Theory

Professor Naftali Tishby applied information theory to analyze deep neural network training, proposing the Information Bottleneck method as a new learning bound for DNNs. His research identified two distinct phases in DNN training: first representing input data to minimize generalization error, then compressing representations by forgetting irrelevant details.

AIBullishOpenAI News · Sep 146/108
🧠

Learning to model other minds

OpenAI has released LOLA (Learning with Opponent-Learning Awareness), an algorithm that enables AI agents to model and adapt to other learning agents. The system can develop collaborative strategies like tit-for-tat in game theory scenarios while maintaining self-interest.

AINeutralOpenAI News · Aug 35/107
🧠

Gathering human feedback

RL-Teacher is an open-source implementation that enables AI training through occasional human feedback instead of traditional hand-crafted reward functions. This technique was developed as a step toward creating safer AI systems and addresses reinforcement learning challenges where rewards are difficult to specify.

AINeutralOpenAI News · Jun 86/106
🧠

Learning to cooperate, compete, and communicate

Multiagent environments where AI agents compete for resources are identified as crucial stepping stones toward AGI development. These environments provide natural curriculum learning through competitive dynamics and create unstable equilibriums that drive continuous improvement, though they require significantly more research to master.

AIBullishOpenAI News · May 246/104
🧠

OpenAI Baselines: DQN

OpenAI has open-sourced OpenAI Baselines, an internal project to reproduce reinforcement learning algorithms with performance matching published results. The initial release includes DQN (Deep Q-Network) and three of its variants, with more algorithms planned for future releases.

AIBullishOpenAI News · Apr 16/106
🧠

Spam detection in the physical world

A breakthrough AI system has been developed that can detect spam in physical environments, representing the first of its kind to be trained entirely through simulation and successfully deployed on a physical robot. This advancement demonstrates the potential for AI to bridge the gap between digital and physical world applications.

AIBearishOpenAI News · Feb 246/105
🧠

Attacking machine learning with adversarial examples

Adversarial examples are specially crafted inputs designed to fool machine learning models into making incorrect predictions, functioning like optical illusions for AI systems. The article explores how these attacks work across different mediums and highlights the challenges in defending ML systems against such vulnerabilities.

AIBullishOpenAI News · Jan 305/107
🧠

Team update

OpenAI announces their team has grown to 45 people, focusing on advancing AI capabilities through novel idea validation, new software systems, and machine learning deployment on robots. This represents continued scaling of one of the leading AI research organizations.

AIBullishOpenAI News · Nov 156/106
🧠

OpenAI and Microsoft

OpenAI announces a partnership with Microsoft to run most of their large-scale AI experiments on Microsoft's Azure cloud platform. This collaboration strengthens the existing relationship between the two companies in AI infrastructure and development.

AIBullishOpenAI News · Nov 96/107
🧠

RL²: Fast reinforcement learning via slow reinforcement learning

The article presents RL², a meta-learning approach that uses slow reinforcement learning to enable fast adaptation to new tasks. This method allows AI agents to quickly learn new behaviors by leveraging prior training experience across multiple related tasks.

AIBullishOpenAI News · Oct 115/104
🧠

Transfer from simulation to real world through learning deep inverse dynamics model

The article discusses research on transferring AI models from simulation environments to real-world applications through deep inverse dynamics modeling. This approach aims to bridge the sim-to-real gap in robotics and AI systems by learning how to map actions to outcomes in physical environments.

AIBullishOpenAI News · Aug 295/105
🧠

Infrastructure for deep learning

Deep learning infrastructure quality acts as a multiplier for research progress and development. The current open-source ecosystem provides tools that enable anyone to build high-quality deep learning infrastructure.

AINeutralOpenAI News · Jun 216/107
🧠

Concrete AI safety problems

Researchers from multiple institutions including Google Brain, Berkeley, and Stanford have published a collaborative paper titled 'Concrete Problems in AI Safety.' The research explores various challenges in ensuring modern machine learning systems operate as intended and addresses safety considerations in AI development.

← PrevPage 69 of 101Next →