#ai-infrastructure News & Analysis

327 articles tagged with #ai-infrastructure. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

327 articles

AIBullishHugging Face Blog · Oct 276/106

🧠

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

Hugging Face releases huggingface_hub v1.0, marking a major milestone after five years of development in open machine learning infrastructure. The release represents the maturation of one of the most important platforms for sharing and collaborating on AI models, datasets, and applications.

AIBullishHugging Face Blog · Sep 106/105

🧠

Fine-tune Any LLM from the Hugging Face Hub with Together AI

Together AI has launched a new feature enabling users to fine-tune any large language model available on the Hugging Face Hub. This development makes custom AI model training more accessible by providing streamlined infrastructure and tooling for developers and researchers.

AIBullishOpenAI News · Jul 235/106

🧠

Model ML is helping financial firms rebuild with AI from the ground up

Model ML CEO Chaz Englander discusses how AI-native infrastructure and autonomous agents are transforming financial services workflows as part of an Executive Function series. The company is helping financial firms rebuild their operations using artificial intelligence from the ground up.

AIBullishNVIDIA AI Blog · Jun 116/103

🧠

NVIDIA CEO Drops the Blueprint for Europe’s AI Boom

NVIDIA CEO Jensen Huang spoke at GTC Paris alongside VivaTech, emphasizing that Europe is not just adopting AI but actively building AI infrastructure. He highlighted the emergence of a new AI industry that will become part of the core intelligence infrastructure.

AIBullishHugging Face Blog · Jun 36/105

🧠

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

The article discusses optimizing GPU efficiency using co-located vLLM (virtual Large Language Model) infrastructure in TRL (Transformer Reinforcement Learning). This approach aims to maximize GPU utilization and reduce computational waste in AI model training and deployment.

AIBullishHugging Face Blog · May 236/106

🧠

Dell Enterprise Hub is all you need to build AI on premises

The article discusses Dell's Enterprise Hub as a comprehensive solution for building AI infrastructure on-premises. This represents Dell's strategic positioning in the growing enterprise AI market by offering integrated hardware and software solutions for organizations looking to deploy AI capabilities locally rather than relying solely on cloud services.

AIBullishOpenAI News · May 76/105

🧠

OpenAI’s response to the Department of Energy on AI infrastructure

OpenAI has submitted a response to the Department of Energy regarding AI infrastructure development in the United States. The response emphasizes the critical importance of infrastructure for AI advancement and outlines how the US can maintain its competitive position in the global AI race.

AIBullishHugging Face Blog · Apr 166/107

🧠

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

The article discusses prefill and decode techniques for optimizing Large Language Model (LLM) performance when handling concurrent requests. These methods aim to improve efficiency and reduce latency in AI systems serving multiple users simultaneously.

AIBullishHugging Face Blog · Apr 96/105

🧠

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

Hugging Face and Cloudflare have partnered to launch FastRTC, a solution designed to enable seamless real-time speech and video processing. This collaboration combines Hugging Face's AI models with Cloudflare's edge computing infrastructure to reduce latency in real-time communications.

AIBullishHugging Face Blog · Mar 286/107

🧠

🚀 Accelerating LLM Inference with TGI on Intel Gaudi

The article discusses accelerating Large Language Model (LLM) inference using Text Generation Inference (TGI) on Intel Gaudi hardware. This represents a technical advancement in AI infrastructure optimization for improved performance and efficiency in LLM deployment.

AINeutralOpenAI News · Mar 266/107

🧠

Security on the path to AGI

OpenAI is implementing comprehensive security measures directly into their infrastructure and models as they progress toward artificial general intelligence (AGI). The company emphasizes proactive adaptation to address security challenges on the path to AGI development.

AIBullishOpenAI News · Mar 115/107

🧠

New tools for building agents

A platform is introducing new tools designed to help developers and enterprises build more useful and reliable AI agents. The announcement indicates an evolution of their existing platform capabilities focused on agent development infrastructure.

AIBullishHugging Face Blog · Jan 226/106

🧠

Hugging Face and FriendliAI partner to supercharge model deployment on the Hub

Hugging Face and FriendliAI have announced a strategic partnership to enhance AI model deployment capabilities on Hugging Face's platform. This collaboration aims to streamline and accelerate the process of deploying machine learning models, making it easier for developers to implement AI solutions.

AIBullishHugging Face Blog · Oct 226/103

🧠

Hugging Face Teams Up with Protect AI: Enhancing Model Security for the ML Community

Hugging Face has partnered with Protect AI to enhance security for machine learning models in their platform. This collaboration aims to provide better security tools and protections for the ML community using Hugging Face's model repository and services.

AIBullishHugging Face Blog · Aug 86/105

🧠

XetHub is joining Hugging Face!

XetHub, a data versioning and collaboration platform, is being acquired by Hugging Face, the leading AI model repository and platform. This acquisition strengthens Hugging Face's data infrastructure capabilities and expands their ecosystem for AI development workflows.

AIBullishHugging Face Blog · Jul 296/105

🧠

Serverless Inference with Hugging Face and NVIDIA NIM

Hugging Face has partnered with NVIDIA to integrate NIM (NVIDIA Inference Microservices) for serverless AI model inference. This collaboration enables developers to deploy and scale AI models more efficiently using NVIDIA's optimized inference infrastructure through Hugging Face's platform.

AIBullishHugging Face Blog · Jul 96/105

🧠

Google Cloud TPUs made available to Hugging Face users

Google Cloud has made its Tensor Processing Units (TPUs) available to Hugging Face users, enabling access to specialized AI hardware for machine learning workloads. This partnership expands computational resources for the AI development community using Hugging Face's platform.

AIBullishOpenAI News · Jun 216/105

🧠

OpenAI acquires Rockset

OpenAI has acquired Rockset, a real-time analytics database company. This acquisition strengthens OpenAI's data infrastructure capabilities and could enhance their AI model training and deployment processes.

AIBullishHugging Face Blog · Mar 226/109

🧠

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

The article discusses binary and scalar embedding quantization techniques that can significantly reduce computational costs and increase speed for retrieval systems. These methods compress high-dimensional vector embeddings while maintaining retrieval performance, making AI search and recommendation systems more efficient and cost-effective.

AIBullishHugging Face Blog · Oct 46/107

🧠

Accelerating over 130,000 Hugging Face models with ONNX Runtime

Microsoft's ONNX Runtime now supports over 130,000 Hugging Face models, providing significant performance improvements for AI model inference. This integration enables faster deployment and execution of popular machine learning models across various hardware platforms.

AIBullishHugging Face Blog · Sep 196/107

🧠

Rocket Money x Hugging Face: Scaling Volatile ML Models in Production

Rocket Money partnered with Hugging Face to address challenges in scaling volatile machine learning models for production environments. The collaboration focuses on implementing robust infrastructure solutions to handle ML model instability and performance variations in real-world applications.

AI × CryptoBullishHugging Face Blog · Sep 16/105

🤖

Fetch Cuts ML Processing Latency by 50% Using Amazon SageMaker & Hugging Face

Fetch.ai has successfully reduced machine learning processing latency by 50% through implementation of Amazon SageMaker and Hugging Face technologies. This technical improvement enhances the performance of Fetch's AI infrastructure and could strengthen its competitive position in the AI-crypto space.

AIBullishHugging Face Blog · May 316/106

🧠

Introducing the Hugging Face LLM Inference Container for Amazon SageMaker

Hugging Face has launched an LLM Inference Container for Amazon SageMaker, enabling easier deployment and scaling of large language models on AWS infrastructure. This integration streamlines the process for developers to host and serve AI models in production environments.

AIBullishHugging Face Blog · May 236/105

🧠

🐶Safetensors audited as really safe and becoming the default

The article title suggests Safetensors, a secure file format for machine learning models, has undergone a security audit and is being adopted as the default format. This indicates improved security standards in AI model distribution and storage.

AINeutralOpenAI News · Mar 246/103

🧠

March 20 ChatGPT outage: Here’s what happened

OpenAI experienced a significant ChatGPT outage on March 20, prompting the company to release findings about the technical bug that caused the disruption. The update provides transparency about the incident and outlines actions taken to prevent similar issues.

← PrevPage 11 of 14Next →