110 articles tagged with #gemini. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv – CS AI · Mar 266/10
🧠Researchers developed PoliticsBench, a new framework to evaluate political bias in large language models through multi-turn roleplay scenarios. The study found that 7 out of 8 major LLMs (Claude, Deepseek, Gemini, GPT, Llama, Qwen) showed left-leaning political bias, while only Grok exhibited right-leaning tendencies.
🧠 Claude🧠 Gemini🧠 Llama
CryptoBearishDecrypt – AI · Mar 106/10
⛓️The Winklevoss twins transferred $130 million worth of Bitcoin to Gemini hot wallets, with blockchain analytics firm Arkham suggesting the move was likely intended for selling. Neither Cameron nor Tyler Winklevoss has publicly commented on the purpose of these large transfers.
$BTC🧠 Gemini
AIBullisharXiv – CS AI · Mar 66/10
🧠Research shows that multi-agent LLM systems using models from different vendors (o4-mini, Gemini-2.5-Pro, Claude-4.5-Sonnet) significantly outperform single-vendor teams in clinical diagnosis tasks. Mixed-vendor configurations achieve superior recall and accuracy by combining complementary strengths and reducing shared biases that affect homogeneous model teams.
🧠 Claude🧠 Gemini
AIBullishThe Verge – AI · Mar 46/101
🧠Google's NotebookLM now generates fully animated 'cinematic' video overviews from user research and notes, upgrading from basic narrated slideshows. The feature uses multiple AI models including Gemini 3, Nano Banana Pro, and Veo 3 to create animated visuals and determine narrative style automatically.
AIBullishThe Verge – AI · Mar 45/101
🧠Google is expanding its Canvas workspace feature to all US users within AI Mode in Search, allowing users to create documents, code, and organize plans in a dedicated panel alongside chat. Previously limited to the Gemini app and travel planning, Canvas now supports creative writing and coding tasks with access to real-time search information.
AIBullishTechCrunch – AI · Mar 46/101
🧠Google has rolled out Canvas in AI Mode to all US users, allowing them to create plans, projects, and applications in English. This expansion makes Google's AI-powered creative tool widely available to American users.
AIBullishThe Register – AI · Mar 46/10
🧠Google has integrated its Gemini AI model into Android Studio Panda 2, enabling developers to build Android applications directly from text prompts. This represents a significant advancement in AI-powered development tools, potentially streamlining app creation workflows.
🧠 Gemini
AIBullishTechCrunch – AI · Mar 45/103
🧠CollectivIQ is a startup that aims to improve AI answer accuracy by aggregating responses from multiple AI models including ChatGPT, Gemini, Claude, and Grok simultaneously. The company's approach involves crowdsourcing chatbot responses to provide users with more reliable information by comparing outputs from up to 10 different AI models.
AI × CryptoBullishDecrypt · Mar 46/105
🤖A Bitcoin Policy Institute study reveals that major AI systems including Claude, GPT, Grok, and Gemini show preference for Bitcoin over traditional fiat currencies and stablecoins. This finding suggests AI models may inherently recognize Bitcoin's value proposition when making currency-related decisions.
$BTC
AIBullishCrypto Briefing · Mar 36/101
🧠Google has launched Gemini 3.1 Flash Lite, positioning it as the fastest and most cost-effective model in the Gemini 3 series. The new AI model targets developers with enhanced speed performance, improved benchmarks, and scalable API pricing structure.
AIBullishThe Verge – AI · Mar 36/104
🧠Google is rolling out new Pixel drop features including Gemini AI's ability to perform tasks like ordering groceries and booking rides through apps like Uber and Grubhub. The agentic AI feature allows Gemini to work autonomously in the background while users can supervise or interrupt its actions, currently available on Pixel 10 series devices.
AIBullishGoogle DeepMind Blog · Mar 36/104
🧠Google has announced Gemini 3.1 Flash-Lite, positioning it as the fastest and most cost-efficient model in their Gemini 3 series. The model appears designed for large-scale deployment with optimized performance and reduced operational costs.
AIBearisharXiv – CS AI · Mar 36/106
🧠Researchers compared human survey responses from 420 Silicon Valley developers with synthetic data from five leading LLMs including ChatGPT, Claude, and Gemini. While AI models produced technically plausible results, they failed to capture counterintuitive insights and only replicated conventional wisdom rather than revealing novel findings.
AINeutralarXiv – CS AI · Mar 36/106
🧠Researchers identified Self-Anchoring Calibration Drift (SACD), where large language models show systematic confidence changes when building on their own outputs in multi-turn conversations. Testing Claude Sonnet 4.6, Gemini 3.1 Pro, and GPT-5.2 revealed model-specific patterns, with Claude showing decreasing confidence and significant calibration errors, while GPT-5.2 exhibited opposite behavior in open-ended domains.
$NEAR
AIBearisharXiv – CS AI · Mar 37/105
🧠A systematic audit of 17 shadow APIs used in 187 academic papers reveals widespread deception, with performance divergence up to 47.21% and identity verification failures in 45.83% of tests. These third-party services claim to provide access to frontier LLMs like GPT-5 and Gemini-2.5 but deliver inconsistent outputs, undermining research validity and reproducibility.
AINeutralThe Verge – AI · Mar 27/108
🧠Apple is reportedly asking Google to set up dedicated servers for a new Gemini-powered version of Siri that meets Apple's privacy requirements. This builds on their January partnership announcement where Google's Gemini AI models would help power Apple's upgraded Siri, indicating Apple's increasing reliance on Google's AI infrastructure.
AIBullisharXiv – CS AI · Mar 26/1015
🧠Aletheia, a mathematics research agent powered by Gemini 3 Deep Think, successfully solved 6 out of 10 problems in the inaugural FirstProof challenge. The AI system demonstrated autonomous mathematical problem-solving capabilities, with expert assessments confirming its solutions though some disagreement existed on Problem 8.
AIBullisharXiv – CS AI · Mar 27/1022
🧠Researchers introduce a framework of four strategies to improve large language models' performance in context-aided forecasting, addressing diagnostic tools, accuracy, and efficiency. The study reveals an 'Execution Gap' where models understand context but fail to apply reasoning, while showing 25-50% performance improvements and cost-effective adaptive routing approaches.
AINeutralarXiv – CS AI · Mar 27/1018
🧠Researchers analyzed how large language models express moral judgments when prompted to role-play different personas. The study found that Claude models are most morally robust, while larger models within families tend to be more susceptible to moral shifts through persona conditioning.
AIBearisharXiv – CS AI · Mar 26/1018
🧠Researchers introduce FRIEDA, a new benchmark for testing cartographic reasoning in large vision-language models, revealing significant limitations. The best AI models achieve only 37-38% accuracy compared to 84.87% human performance on complex map interpretation tasks requiring multi-step spatial reasoning.
AIBullisharXiv – CS AI · Feb 276/108
🧠Researchers developed GYWI, a scientific idea generation system that combines author knowledge graphs with retrieval-augmented generation to help Large Language Models generate more controllable and traceable scientific ideas. The system significantly outperforms mainstream LLMs including GPT-4o, DeepSeek-V3, Qwen3-8B, and Gemini 2.5 in metrics like novelty, reliability, and relevance.
AIBullisharXiv – CS AI · Feb 276/106
🧠Researchers developed a hybrid system combining machine learning ensembles with large language models for heart disease prediction, achieving 96.62% accuracy. The study found that traditional ML models (95.78% accuracy) outperformed standalone LLMs (78.9% accuracy), but combining both approaches yielded the best results for clinical decision-support tools.
AIBullishArs Technica – AI · Feb 266/106
🧠Google has launched Nano Banana 2, a new AI image generation model that replaces previous versions and is now available in Gemini. The model represents Google's latest advancement in AI image generation technology.
AIBullishThe Verge – AI · Feb 266/106
🧠Google has launched Nano Banana 2 (Gemini 3.1 Flash Image), bringing advanced AI image generation capabilities previously exclusive to Nano Banana Pro to free users. The new model offers faster, cheaper, and easier complex image generation with real-time information and web search integration.
AIBullishTechCrunch – AI · Feb 266/103
🧠Google has launched Nano Banana 2, a new AI model featuring faster image generation capabilities. The model is being integrated as the default in Google's Gemini app and AI mode, representing a significant update to Google's AI infrastructure.