$ timeahead_
all sourcesAhead of AI (Sebastian Raschka)Anthropic NewsApple Machine Learning ResearchArs Technica AIAWS Machine Learning BlogCerebras BlogCohere BlogCrewAI BlogDeepSeek BlogDistill.pubfast.ai BlogFireworks AI BlogGoogle AI BlogGoogle Cloud AI BlogGoogle DeepMind BlogGroq BlogHaystack (deepset) BlogHugging Face BlogImport AI (Jack Clark)LangChain BlogLangFuse BlogLil'Log (Lilian Weng)LlamaIndex BlogMeta AI BlogMicrosoft AutoGen BlogMicrosoft Research BlogMistral AI NewsMIT Technology ReviewModal Blogn8n BlogNathan Lambert (RLHF)NVIDIA Developer BlogOllama BlogOpenAI BlogPerplexity AI BlogPyTorch BlogReplicate BlogSimon Willison BlogTensorFlow BlogThe Batch (DeepLearning.AI)The GradientThe Verge AITogether AI BlogVentureBeat AIvLLM BlogWeights & Biases BlogWired AIxAI (Grok) Blog
allapiagentsframeworkshardwareinframodelopen sourcereleaseresearchtutorial
★ TOP STORY[ WA ]Tutorial·2d ago

At 'AI Coachella,' Stanford Students Line Up to Learn From Silicon Valley Royalty

As thousands of influencers descended on southern California earlier this month for the annual Coachella Music Festival, a very Silicon Valley program dubbed “AI Coachella” was taking shape a few hundred miles north in Palo Alto. The class, CS 153, is one of Stanford’s buzziest offerings this semester, and like the music festival, it features a star-studded lineup of celebrities—in this case, not pop artists, but Big Tech CEOs. The course is co-taught by Anjney Midha, a former Andreessen Horowitz general partner, and Michael Abbott, Apple’s former VP of engineering for cloud services. The list of guest lecturers reads like a Signal group chat many VCs would pay to join: OpenAI CEO Sam Altman, Nvidia CEO Jensen Huang, Microsoft CEO Satya Nadella, AMD CEO Lisa Su, Anthropic philosopher Amanda Askell, and White House Senior Policy Advisor for AI Sriram Krishnan,…

Wired AIread →
▲ trending · last 48hview all →
[ATA]Ars Technica AI· 5 articlesvisit →
2d ago
Greenhouse gases from data center boom could outpace entire nations
New gas projects linked to just 11 data center campuses around the US have the potential to create more greenhouse gases than the country of Morocco emitted in 2024. Emissions estimates from air permit documents examined by WIRED show that these natural gas projects—which are being built to power data centers to serve some of the US’s most powerful AI companies, including OpenAI, Meta, Microsoft, and xAI—have the potential to emit more than 129 million tons of greenhouse gases per year. As tech companies race to secure massive power deals to build out hundreds of data centers across the country, these projects represent just the tip of the iceberg when it comes to the potential climate cost of the AI boom. The infrastructure on this list of large natural gas projects reviewed by WIRED is being developed to largely bypass…
2dInfraby Molly Taft, wired.com
2d ago
US accuses China of “industrial-scale” AI theft. China says it’s “slander.”
The US is preparing to crack down on China’s allegedly “industrial-scale theft of American artificial intelligence labs’ intellectual property,” the Financial Times reported Thursday. Since the launch of DeepSeek—a Chinese model that OpenAI claimed was trained using outputs from its models—other AI firms have accused global rivals of using a method called distillation to steal their IP. In January, Google claimed that “commercially motivated” actors not limited to China attempted to clone its Gemini AI chatbot by promoting the model more than 100,000 times in bids to train cheaper copycats. The next month, Anthropic accused Chinese firms DeepSeek, Moonshot, and MiniMax of using the same tactic to generate “over 16 million exchanges with Claude through approximately 24,000 fraudulent accounts.” Also in February, OpenAI confirmed that most attacks it saw originated from China. For the US, these distillation attacks supposedly threaten…
2dHardware#claude#geminiby Ashley Belanger
3d ago
Indian med student rakes in thousands with AI-generated MAGA hottie
Like many medical school students, Sam was broke. The 22-year-old aspiring orthopedic surgeon from northern India got some money from his parents, but he says he spent most of it subsidizing his licensing exams, and he’s still saving up to hopefully emigrate to the US after graduation. So he started searching for ways to make additional money online. Sam, who requested a pseudonym to avoid jeopardizing his medical career and immigration status, tried a few things, with varying degrees of legitimacy and success. He made YouTube shorts and sold study notes to other med students. It wasn’t until he started scrolling through his Instagram feed that he landed on an idea: Why not make an AI-generated girl using Google Gemini’s Nano Banana Pro and sell bikini photos of her online? But when Sam started posting generic photos of a beautiful,…
3dResearch#geminiby Ej Dickson, wired.com
3d ago
Google unveils two new TPUs designed for the "agentic era"
Most of the companies that have fully committed to building AI models are gobbling up every Nvidia AI accelerator they can get, but Google has taken a different approach. Most of its cloud AI infrastructure is based on its line of custom Tensor processing units (TPUs). After announcing the seventh-gen Ironwood TPU in 2025, the company has moved on to the eighth-gen version, but it’s not just a faster iteration of the same chip. The new TPUs come in two flavors, providing Google and its customers with an AI platform that is faster and more efficient, the company says. Google is pushing the idea that the “agent era” is fundamentally different from the AI systems that came before, necessitating a new approach to the hardware. So engineers have devised the TPU8t (for training) and the TPU 8i (for inference). Before…
3dHardware#agents#inference#trainingby Ryan Whitwam
3d ago
Anthropic tested removing Claude Code from the Pro plan
Anthropic caused a stir among developers with what appeared to be a surprise change to its pricing plan: The company signaled that Claude Code, the popular agentic development tool, would no longer be available to subscribers on the $20-per-month Pro plan. Users took to Reddit and X to point out that Anthropic’s pricing page for Claude explicitly showed Claude Code as not supported in the Pro plan. (It remained in the $100/month+ Max plan.) Some new users signing up for Pro subscriptions were unable to access Claude Code. Meanwhile, existing subscribers saw no interruption. After speculation and frustration spread, Anthropic’s head of growth, Amol Avasare, took to social media to clarify that this was a “small test on ~2% of new prosumer signups.” As for the reasoning, he explained: When we launched Max a year ago, it didn’t include Claude…
3dModel#claude#codingby Samuel Axon
[AWS]AWS Machine Learning Blog· 6 articlesvisit →
2d ago
Applying multimodal biological foundation models across therapeutics and patient care
Artificial Intelligence Applying multimodal biological foundation models across therapeutics and patient care Healthcare and life sciences decision making increasingly relies on multimodal data to diagnose diseases, prescribe medicine and predict treatment outcomes, develop and optimize innovative therapies accurately. Traditional approaches analyze fragmented data, such as ‘omics for drug discovery, medical images for diagnostics, clinical trial reports for validation, and electronic health records (EHR) for patient treatment. As a result, decision makers (CxOs, VPs, Directors) often miss critical insights hidden in the relationships between data types. Recent advancements in AI enable you to integrate and analyze these fragmented data streams efficiently to support a more complete understanding of therapeutics and patient care. AWS provides a unified environment for multimodal biological foundation models (BioFMs), enabling you to make more confident, timely decision-making in personalized medicine. This AI system combines biological data, model…
2dInfra#multimodalby Kristin Ambrosini
2d ago
Amazon Quick for marketing: From scattered data to strategic action
Artificial Intelligence Amazon Quick for marketing: From scattered data to strategic action Imagine the following scenario: You’re leading marketing campaigns, creating content, or driving demand generation. Your campaigns are scattered and your insights are buried. By the time you’ve pieced together what’s working, the moment to act has already passed. This isn’t a tools problem because you have plenty of those. It’s a connection problem. Your marketing systems and tools are disconnected, so you spend time moving data between systems instead of improving campaigns or sharing results with your team. Amazon Quick changes how you work. You can set it up in minutes and by the end of the day, you will wonder how you ever worked without it. Quick connects with your applications, tools, and data, creating a personal knowledge graph that learns your priorities, preferences, and network. It…
2dby Zach Conley
3d ago
Company-wise memory in Amazon Bedrock with Amazon Neptune and Mem0
Artificial Intelligence Company-wise memory in Amazon Bedrock with Amazon Neptune and Mem0 This post is cowritten by Shawn Tsai from TrendMicro. Delivering relevant, context-aware responses is important for customer satisfaction. For enterprise-grade AI chatbots, understanding not only the current query but also the organizational context behind it is key. Company-wise memory in Amazon Bedrock, powered by Amazon Neptune and Mem0, provides AI agents with persistent, company-specific context—enabling them to learn, adapt, and respond intelligently across multiple interactions. TrendMicro, one of the largest antivirus software companies in the world, developed the Trend’s Companion chatbot, so their customers can explore information through natural, conversational interactions (learn more). TrendMicro aimed to enhance its AI chatbot service to deliver personalized, context-aware support for enterprise customers. The chatbot needed to retain conversation history for continuity, reference company-specific knowledge at scale, and ensure that memory remained…
3dTutorialby Shawn Tsai
3d ago
Get to your first working agent in minutes: Announcing new features in Amazon Bedrock AgentCore
Artificial Intelligence Get to your first working agent in minutes: Announcing new features in Amazon Bedrock AgentCore Getting an agent running has always meant solving a long list of infrastructure problems before you can test whether the agent itself is any good. You wire up frameworks, storage, authentication, and deployment pipelines, and by the time your agent handles its first real task, you’ve spent days on infrastructure instead of agent logic. We built AgentCore from the ground up to help developers focus on building agent logic instead of backend plumbing, working with frameworks and models they already use, including LangGraph, LlamaIndex, CrewAI, Strands Agents, and more. Today, we’re introducing new capabilities that further streamline the agent building experience, removing the infrastructure barriers that slow teams down at every stage of agent development from the first prototype through production deployment. Go…
3dInfra#agentsby Madhu Parthasarathy
3d ago
Amazon SageMaker AI now supports optimized generative AI inference recommendations
Artificial Intelligence Amazon SageMaker AI now supports optimized generative AI inference recommendations Organizations are racing to deploy generative AI models into production to power intelligent assistants, code generation tools, content engines, and customer-facing applications. But deploying these models to production remains a weeks-long process of navigating GPU configurations, optimization techniques, and manual benchmarking, delaying the value these models are built to deliver. Today, Amazon SageMaker AI supports optimized generative AI inference recommendations. By delivering validated, optimal deployment configurations with performance metrics, Amazon SageMaker AI keeps your model developers focused on building accurate models, not managing infrastructure. We evaluated several benchmarking tools and chose NVIDIA AIPerf, a modular component of NVIDIA Dynamo, because it exposes detailed, consistent metrics and supports diverse workloads out of the box. Its CLI, concurrency controls, and dataset options give us the flexibility to iterate quickly and…
3dInfra#inference#codingby Mona Mona
3d ago
Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch
Artificial Intelligence Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch Many organizations are archiving large media libraries, analyzing contact center recordings, preparing training data for AI, or processing on-demand video for subtitles. When data volumes grow significantly, managed automatic speech recognition (ASR) service costs can quickly become the primary constraint on scalability. To address this cost-scalability challenge, we use the NVIDIA Parakeet-TDT-0.6B-v3 model, deployed through AWS Batch on GPU-accelerated instances. Parakeet-TDT’s Token-and-Duration Transducer architecture simultaneously predicts text tokens and their duration to intelligently skip silence and redundant processing. This helps achieve inference speeds orders of magnitude faster than real-time. By paying only for brief bursts of compute rather than the full length of your audio, you can transcribe at scale for fractions of a cent per hour of audio based on the benchmarks described in this post.…
3dTutorial#rag#inference#multimodalby Gleb Geinke
[CB]Cerebras Blog· 29 articlesvisit →
2d ago
Figma - MultiAgents April 16, 2026
Everything is easier now. I have been toying around with agent orchestration for a while now. I’m currently running 10-20 agents around the clock.AI agents are now capable of bringing my ideas to life. Like many developers, I’ve been feeling the token anxiety. I can do much more now than ever before, and every time I have a spare minute I want to kick off another agent session. - I see a cool product I don’t want to pay for? Codex will build it for me. - I have a silly idea I want to see come to life? Codex will build it for me. - I get mildly annoyed doing the same thing over and over? Codex pls. If you have an army of infinitely patient, intelligent, and helpful agents waiting for your next command, why shouldn’t we take…
3d ago
Debugging Dead MoE Models: A Step-by-Step Guide August 19, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
MoE at Scale: Making Sparse Models Fast on Real Hardware September 03, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Cerebras and Docker Compose: Building Isolated AI Code Environments September 17, 2025
AI Developers run Cerebras inference inside Docker containers, with Docker Compose, to create safe environments for AI-generated code and inference speed
3d ago
Cerebras CS-3 vs. Groq LPU September 19, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Cerebras CS-3 vs. Nvidia DGX B200 Blackwell September 19, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Cerebras API Certification Partner Program for LLM API Providers September 22, 2025
Cerebras inference - the fastest inference API for generative AI.
3dInfra#inference
3d ago
The Fastest AI Datacenters will run on Cerebras: Meet OKC September 22, 2025
Cerebras inference - the fastest inference API for generative AI.
3dInfra#inference
3d ago
Cerebras Inference: Now Available via Pay Per Token October 13, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
MoE Math Demystified: What Does 8x7B Actually Mean? October 14, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
REAP: One-Shot Pruning for Trillion-Parameter Mixture-of-Experts Models October 16, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Building Instant RL Loops with Meta Llama Tools and Cerebras October 27, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Cerebras October 2025 Highlights November 03, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Cerebras February 2026 Highlights November 03, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
OpenAI GPT-OSS 120B Benchmarked – NVIDIA Blackwell vs. Cerebras November 06, 2025
Nvidia Blackwell is upgrade over Hopper, with top speed of GPU inference by 2-3x and leapfrogging small-chip AI competitors. Cerebras outperforms Nvidia,
3d ago
The world’s fastest GLM-4.6 – now available on Cerebras November 18, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Scaling Code-Repair Agents with Reinforcement Learning: Extending OpenHands for Real-World Repositories November 24, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Scaling SWE Agent Data Collection with Dockerized Environments for Execution November 24, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Rox × Cerebras: Real-time speed for agentic sales workflows November 25, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Cerebras at NeurIPS 2025: Nine Papers From Pretraining to Inference December 04, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Jais 2: A Blueprint for Sovereign AI December 09, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Case Study - Cognition x Cerebras December 10, 2025
Powered by Cerebras Inference, Cognition’s SWE-1.5 and the SWE-grep family deliver frontier-level coding performance up to 13x faster than general-purpose models—keeping developers in flow while they explore codebases, ship features, and debug complex systems
3d ago
Thinking Inside the Box: The Implicit Chain Transformer for Efficient State Tracking December 12, 2025
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
2026: Fast Inference Finds its Groove January 06, 2026
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
GLM-4.7: Frontier intelligence at record speed — now available on Cerebras January 08, 2026
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
This new model is smarter than Sonnet 4.5…and 20X faster? January 08, 2026
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
OpenAI Partners with Cerebras to Bring High-Speed Inference to the Mainstream January 14, 2026
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
StackAI × Cerebras: enabling the fastest inference for enterprise AI agents January 28, 2026
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
3d ago
Fast inference is going mainstream — the Cerebras ecosystem is scaling access January 28, 2026
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
[CB]CrewAI Blog· 2 articlesvisit →
3d ago
How a Healthcare Provider Cuts Nurse Intake Work by 80% with Agentic AI Discover how healthcare automates patient intake using agentic AI, cutting nurse intake time by up to 80% to improve efficiency and patient experience. Alex Clay Apr 22, 2026
How a Healthcare Provider Cuts Nurse Intake Work by 80% with Agentic AI Discover how healthcare automates patient intake using agentic AI, cutting nurse intake time by up to 80% to improve efficiency and patient experience. Manual Intake Overloads Nurses and Patients Three nurses spend four hours daily on patient intake at many healthcare providers. These clinicians spend a third of their shift reading, assessing insurance eligibility, and routing forms instead of delivering care. It exhausts staff and costs money. When eligibility checks lag or forms are misrouted, patient satisfaction falls and costs rise. Intake Bottlenecks Waste Thousands of Nursing Hours On average, nurses at large health systems spend 4 hours each day on intake forms. When handling thousands of patients, this wastes thousands of nursing hours weekly. Manual workflows cause insurance verification errors above 20%, triggering denials and delayed…
3dAgents#agents
3d ago
Latest
How a Healthcare Provider Cuts Nurse Intake Work by 80% with Agentic AI Discover how healthcare automates patient intake using agentic AI, cutting nurse intake time by up to 80% to improve efficiency and patient experience. How One E-Commerce Giant Automates Returns and Refunds with Agentic AI E-commerce returns automation cuts costs and boosts refunds with multi-agent AI handling classification, verification, and response drafting efficiently. Agent Harnesses Are Dead. Long Live Agent Harnesses. I said it at a Dev Day 2025 (DeepLearning.AI conference) last year: Frameworks are cheap. A few people in the audience looked uncomfortable. But I think the statement aged well. Now it's not just frameworks. Harnesses are getting the same treatment, and the cycle is only getting How a Global CPG Automates Supply Chain Demand Forecasting with Agentic AI Discover how CPG supply chains use agentic AI…
3dAgents#agents
[HF]Hugging Face Blog· 1 articlesvisit →
3d ago
Gemma 4 VLA Demo on Jetson Orin Nano Super
Gemma 4 VLA Demo on Jetson Orin Nano Super You speak → Parakeet STT → Gemma 4 → [Webcam if needed] → Kokoro TTS → Speaker Press SPACE to record, SPACE again to stop. This is a simple VLA: the model decides on its own whether to act based on the context of what you asked, no keyword triggers, no hardcoded logic. If your question needs Gemma to open her eyes, she'll decide to take a photo, interpret it, and answer you with that context in mind. She's not describing the picture, she's answering your actual question using what she saw. And honestly? It's pretty impressive that this runs on a Jetson Orin Nano. :) Get the code The full script for this tutorial lives on GitHub, in my Google_Gemma repo next to the Gemma 2 demos: 👉 github.com/asierarranz/Google_Gemma Grab…
3dTutorial#coding
[MRB]Microsoft Research Blog· 1 articlesvisit →
3d ago
AutoAdapt: Automated domain adaptation for large language models
At a glance - Problem: Adapting large language models to specialized, high-stakes domains is slow, expensive, and hard to reproduce. - What we built: AutoAdapt automates planning, strategy selection (e.g., RAG vs. fine-tuning), and tuning under real deployment constraints. - How it works: A structured configuration graph maps the full scope of the adaptation process, an agentic planner selects and sequences the right steps, and a budget-aware optimization loop (AutoRefine) refines the process within defined constraints. - Why it matters: The result is faster, automated, more reliable domain adaptation that turns weeks of manual iteration into repeatable pipelines. Deploying large language models (LLMs) in real-world, high-stakes settings is harder than it should be. In high-stakes settings like law, medicine, and cloud incident response, performance and reliability can quickly break down because adapting models to domain-specific requirements is a slow and…
3dInfra#rag#agents#fine-tuningby Sidharth Sinha, Anson Bastos, Xuchao Zhang, Akshay Nambi, Rujia Wang, Chetan Bansal
[MTR]MIT Technology Review· 8 articlesvisit →
2d ago
Will fusion power get cheap? Don’t count on it.
Will fusion power get cheap? Don’t count on it. New research suggests that cost declines could be slow for the technology. Fusion power could provide a steady, zero-emissions source of electricity in the future—if companies can get plants built and running. But a new study suggests that even if that future arrives, it might not come cheap. Technologies tend to get less expensive over time. Lithium-ion batteries are now about 90% cheaper than they were in 2013. But historically, different technologies tend to go through this curve at different rates. And the cost of fusion might not sink as quickly as the prices of batteries or solar. It’s tricky to make any predictions about the cost of a technology that doesn’t exist yet. But when there’s billions of dollars of public and private funding on the line, it’s worth considering…
2dResearchby Casey Crownhart
2d ago
The Download: introducing the Nature issue
The Download: introducing the Nature issue Plus: Trump signaled he’s open to reversing the Anthropic ban. This is today's edition of The Download, our weekday newsletter that provides a daily dose of what's going on in the world of technology. Introducing: the Nature issue When we talk about “nature,” we usually mean something untouched by humans. But little of that world exists today. From microplastics in rainforest wildlife to artificial light in the Arctic Ocean, human influence now reaches every corner of Earth. In this context, what even is nature? And should we employ technology to try to make the world more “natural”? In our new Nature issue, MIT Technology Review grapples with these questions. We investigate birds that can’t sing, wolves that aren’t wolves, and grass that isn’t grass. We look for the meaning of life under Arctic ice,…
2dReleaseby Thomas Macaulay
3d ago
Los Angeles is finally going underground
Los Angeles is finally going underground The long-under-construction subway arrives in a famous part of town. Los Angeles deserves its reputation as the quintessential car city—the rhythms of its 2,200 square miles are dictated by wide boulevards and concrete arcs of freeways. But it once had a world-class rail transit system, and for the last three decades, the city has been rebuilding a network of trolleys and subways. In May, a new four-mile segment with three new subway stations will open along Wilshire Boulevard, a key east-west corridor that connects downtown LA to the Pacific Ocean. What today can be an hours-long drive through a busy, museum-packed stretch of the city will be, if all goes well, a 25-minute train ride. The existence of subway stops in this part of town—known as Miracle Mile—is a technological triumph over geography and…
3dby Adam Rogers
3d ago
There is no nature anymore
There is no nature anymore No part of the globe is free of human fingerprints. Should we deploy technology to change it back? When people talk about “nature,” they’re generally talking about things that aren’t made by human beings. Rocks. Reefs. Red wolves. But while there is plenty of God’s creation to go around, it is hard to think of anything on Earth that human hands haven’t affected. In the Brazilian rainforest, scientists have found microplastics in the bellies of animals ranging from red howler monkeys to manatees. In remotest Yakutia, where much of the earth remains untrodden by human feet, the carbon in the sky above melts the permafrost below. In the Arctic Ocean, artificial light from ship traffic—on the rise as the polar ice cap melts away—now disrupts the nightly journey of zooplankton to the ocean surface, one…
3dby Mat Honan
3d ago
One town’s scheme to get rid of its geese
One town’s scheme to get rid of its geese Public officials in one California burgh spent nearly $400,000 on tech to flush out waterfowl. “Pull over!” I order my brother one sunny February afternoon. Our target is in sight: a gaggle of Canada geese, pecking at grass near the dog park. As I approach, tiptoeing over their grayish-white poop, I notice that one bird wears a white cuff around its slender black neck. It’s a GPS tracker—part of a new tech-centered campaign to drive the geese out of my hometown of Foster City, California. About 300 geese live in this sleepy Bay Area suburb, equal to nearly 1% of our human population—and some say this town isn’t big enough for the both of us. Goose poop notoriously blanketed our middle school’s lawn, and the birds have hassled residents for generations.…
3dby Annika Hom
3d ago
3 things Michelle Kim is into right now
3 things Michelle Kim is into right now MIT Technology Review’s editorial fellow shares what she’s been thinking about lately. Isegye Idol If you thought K-pop was weird, virtual idols—humans who perform as anime-style digital characters via motion capture—will blow your mind. My favorite is a girl group called Isegye Idol, created by Woowakgood, a Korean VTuber (a streamer who likewise performs as a digital persona). Isegye Idol’s six members are anonymous, which seems to let them deploy a rare breed of honesty and humor. They play games (League of Legends, Go, Minecraft), chitchat, and perform kitschy music that’s somewhere between anime soundtrack and video-game score. It’s very DIY—and very intimate. And the group’s wild popularity speaks to the mood of Gen Z South Koreans, famously lonely and culturally adrift—struggling to find work, giving up on dating, trying to find…
3d#multimodalby Michelle Kim
3d ago
AI needs a strong data fabric to deliver business value
Sponsored AI needs a strong data fabric to deliver business value A modern data fabric makes it possible to turn existing enterprise knowledge into a trusted foundation for AI. In partnership withSAP Artificial intelligence is moving quickly in the enterprise, from experimentation to everyday use. Organizations are deploying copilots, agents, and predictive systems across finance, supply chains, human resources, and customer operations. By the end of 2025, half of companies used AI in at least three business functions, according to a recent survey. But as AI becomes embedded in core workflows, business leaders are discovering that the biggest obstacle is not model performance or computing power but the quality and the context of the data on which those systems rely. AI essentially introduces a new requirement: Systems must not only access data — they must understand the business context behind…
3dResearch#codingby MIT Technology Review Insights
3d ago
The Download: introducing the 10 Things That Matter in AI Right Now
The Download: introducing the 10 Things That Matter in AI Right Now Plus: An unauthorized group has reportedly accessed Anthropic’s Mythos. This is today's edition of The Download, our weekday newsletter that provides a daily dose of what's going on in the world of technology. Introducing: 10 Things That Matter in AI Right Now What actually matters in AI right now? It’s getting harder to tell amid the constant launches, hype, and warnings. To cut through the noise, MIT Technology Review’s reporters and editors have distilled years of analysis into a new essential guide: the 10 Things That Matter in AI Right Now. The list builds on our annual 10 Breakthrough Technologies, but takes a wider view of the ideas, topics, and research shaping AI, spotlighting the trends and breakthroughs shaping the world. We’ll be unpacking one item from the…
3dReleaseby Thomas Macaulay
[NV]NVIDIA Developer Blog· 4 articlesvisit →
2d ago
Winning a Kaggle Competition with Generative AI–Assisted Coding
In March 2026, three LLM agents generated over 600,000 lines of code, ran 850 experiments, and helped secure a first-place finish in a Kaggle playground competition. Success in modern machine learning competitions is increasingly defined by how quickly you can generate, test, and iterate on ideas. LLM agents, combined with GPU acceleration, dramatically compress this loop. Historically, two bottlenecks have limited this experimentation: - How quickly you can write code for new experiments. - How quickly you can execute those experiments. GPUs and libraries like NVIDIA cuDF, NVIDIA cuML, XGBoost, and PyTorch have largely solved the second problem. LLM agents now address the first problem—unlocking a new scale of rapid, iterative experimentation. This blog post describes how I used LLM agents to accelerate the discovery of the most performant tabular data prediction solutions. Case study: Kaggle Playground churn prediction The…
2dResearch#codingby Chris Deotte
3d ago
Advancing Emerging Optimizers for Accelerated LLM Training with NVIDIA Megatron
Higher-order optimization algorithms such as Shampoo have been effectively applied in neural network training for at least a decade. These methods have achieved significant success more recently when applied to leading LLMs. In particular, Muon (MomentUm Orthogonalized by Newton-Schulz) was used to train some of today’s best open source models, including Kimi K2 and GLM-5. This post explains how NVIDIA provides comprehensive support for Muon and other cutting-edge emerging optimizers and the technologies enabling them to train large-scale models. Muon training performance on NVIDIA GB300 NVL72 Table 1 summarizes training throughput of the Kimi K2 and Qwen3 30B models with Muon and the AdamW optimizer on the NVIDIA GB300 NVL72 system. With the technologies that will be introduced in the next section, the results show that there is a very small training performance loss using the Muon optimizer compared to…
3d ago
Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20
AI integration is redefining mainstream enterprise applications, from productivity software like Microsoft Office to more complex design and engineering tools. This shift requires the modern data center to move beyond single-purpose silos. For developers, gaining access to dedicated GPU compute can often be a bottleneck. Virtual machines (VMs) solve part of this challenge by providing secure, isolated, and scalable environments tailored to specific project needs. However, dedicating an entire physical GPU to a single VM is highly inefficient for mixed or lightweight workloads. This is where NVIDIA Multi-Instance GPU (MIG) technology becomes essential. With MIG, a single physical GPU is partitioned at the hardware level into multiple fully independent instances, each with guaranteed memory, cache, and compute cores. For a development team, this ensures predictable, uncompromising Quality of Service (QoS). This means that multiple developers can simultaneously train AI models,…
3dHardware#gpuby Phoebe Lee
3d ago
Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python
In a previous post, we introduced the Universal Sparse Tensor (UST), enabling developers to decouple a tensor’s sparsity from its memory layout for greater flexibility and performance. We’re excited to announce the integration of the UST into nvmath-python v0.9.0 to accelerate sparse scientific and deep learning applications. This post provides a walkthrough of key UST features, implementation details, and performance overview, including: - Zero-cost interoperability: Data-movement-free conversion with PyTorch, SciPy, and CuPy. - Custom formats: Define novel sparsity schemes. - Polymorphic operations: Sparsity-agnostic functions automatically use optimized kernels or generate custom sparse code—eliminating the need for manual coding of new formats. - PyTorch injection: Easily inject UST performance benefits into existing PyTorch models. - Transparent caching: Avoid JIT/LTO recompilation and replanning—amortizing overhead over subsequent repeated execution of the same operation. Tensor format DSL The UST describes common (e.g., COO, CSR,…
3dTutorial#codingby Aart J.C. Bik
[OAI]OpenAI Blog· 16 articlesvisit →
2d ago
GPT-5.5 Bio Bug Bounty
GPT‑5.5 Bio Bug Bounty Testing universal jailbreaks for biorisks in GPT‑5.5 As part of our ongoing efforts to strengthen our safeguards for advanced AI capabilities in biology, we’re introducing a Bio Bug Bounty for GPT‑5.5 and accepting applications. We’re inviting researchers with experience in AI red teaming, security, or biosecurity to try to find a universal jailbreak that can defeat our five-question bio safety challenge. - Model in scope: GPT‑5.5 in Codex Desktop only. - Challenge: Identify one universal jailbreaking prompt to successfully answer all five bio safety questions from a clean chat without prompting moderation. - Rewards: - $25,000 to the first true universal jailbreak to clear all five questions. - Smaller awards may be granted for partial wins at our discretion. - Timeline: Applications open April 23, 2026 with rolling acceptances, and close on June 22, 2026. Testing…
2dModel#safety
2d ago
Top 10 uses for Codex at work
Top 10 uses for Codex at work Try these 10 prompts to move real work forward with dashboards, decks, workflows, and more. You’ve seen what Codex can do. Now it’s time to put it to work. These use cases show how to use Codex to do real work: create deliverables, pull together context from multiple tools, take action on real inputs, and move tasks forward faster. Start with the generic prompt if you want something you can use right away, then use the customization suggestions and example to make it your own. You start the day by bouncing between your calendar, messages, email, and notes, trying to figure out what matters most. Codex can pull that context together, keep watch for changes, and turn it into one clear brief so you spend less time triaging and more time acting on…
2dHardware#agents
2d ago
Codex settings
Codex settings Make Codex work the way you want, with fewer interruptions. You can access settings from the menu in the bottom left corner of Codex. For your first few tasks, focus on a few key settings: personalization, prevent sleep, detail level, and appearance. General > Prevent sleep while running keeps your computer awake while Codex is running. This is useful for longer tasks. If your computer goes to sleep, Codex may stop working. General > Detail level controls how much information Codex shows while it is working. Coding mode shows the specific commands Codex is executing. If this is more information than you need, switch to Default to keep your conversation cleaner. Personalization works a lot like personalization in ChatGPT. You can decide whether you want Codex to speak to you in a friendly tone or a direct tone.…
2dTutorial#agents
2d ago
How to get started with Codex
How to get started with Codex Tips to set up Codex, create your first project, and start completing real tasks. Start by downloading the Codex desktop app and signing in with your ChatGPT account. Once you open Codex, create your first thread. A thread is like a chat in ChatGPT: a space where you go back and forth with Codex to accomplish a task. You can create a standalone thread, but most of the time you’ll want to work inside a project. A project is connected to a folder on your computer: Tip: To keep things simple, create a folder on your computer named Codex. Inside that Codex folder, you can have a separate folder for each project. If you want Codex to work with specific files for a project, just drag them into the folder. If not, you can…
2dTutorial
2d ago
Working with Codex
Working with Codex Learn how to set up your Codex workspace and start working with threads and projects. When you open Codex, you’ll see a few core elements: a sidebar menu, projects, settings, and a chat window. You don’t need to understand everything right away, but we’ll cover the basics here. The sidebar is where you navigate between threads, projects, and tools. Most of your work will begin by creating a new thread. When you’re using Codex, think of a “thread” the same way you would think of a “chat” in ChatGPT. You can have a thread which stands on its own, or a thread which is nested within a project. Select New thread to begin. You can select an existing project to associate it with, create a new project, or leave it as a standalone conversation. Search to find…
2dTutorial
2d ago
Plugins and skills
Plugins and skills Plugins and skills help Codex do more specific kinds of work. Plugins help Codex connect to other tools and sources of information. For example, a plugin might help Codex reference files in Google Drive, scan your email inbox, or work with information from another tool you use. Plugins can be simple and useful right away. If you already have the information you need in a connected plugin, you can ask Codex to use it instead of copying and pasting everything into the thread. To access plugins, select plugins in the top left corner of Codex. From there, you can see plugins that are recommended or already installed, browse the plugins library, or create a new plugin. Creating a new plugin usually requires more technical expertise than creating a skill. A skill is like a playbook Codex can…
2dTutorial#agents
2d ago
Automations
Automations Run recurring tasks automatically using schedules and triggers in Codex. Codex can automatically run tasks on a schedule. This makes Codex proactive. Instead of waiting for you to come back and ask for an update, Codex can return at the scheduled time, do the work, and surface the result for you to review. This is useful for recurring work, like preparing for the day, reviewing what changed, checking for updates, summarizing recent activity, or creating a weekly report. For example, you might use a thread automation to: - Write a weekly review every Friday - Create a morning brief from yesterday’s work - Summarize new files added to a folder - Clean up a weekly data export - Check for missing or inconsistent information - Create a recurring project status update Some automations can also return to the same…
2dTutorial#agents
2d ago
What is Codex?
What is Codex? Understand what Codex is and how it fits into your work Codex is an AI agent that you can delegate real work to. ChatGPT is great for asking questions, brainstorming, and drafting in conversation. Codex is designed for a different kind of task—it can work across files, tools, and repeatable workflows to help move work forward. A simple way to think about it: ChatGPT helps you think through the work, while Codex helps you hand off parts of the work itself. You don’t need to be a developer or working on software to use Codex. It goes beyond coding and is especially useful for tasks that require more than a single answer—like gathering information from multiple sources, creating and updating files, or producing outputs such as documents, slides, and spreadsheets. Codex can connect to tools, take action,…
2dTutorial
2d ago
Introducing GPT-5.5
Update on April 24, 2026: GPT‑5.5 and GPT‑5.5 Pro are now available in the API. The system card has also been updated to describe the additional safeguards that apply. We’re releasing GPT‑5.5, our smartest and most intuitive to use model yet, and the next step toward a new way of getting work done on a computer. GPT‑5.5 understands what you’re trying to do faster and can carry more of the work itself. It excels at writing and debugging code, researching online, analyzing data, creating documents and spreadsheets, operating software, and moving across tools until a task is finished. Instead of carefully managing every step, you can give GPT‑5.5 a messy, multi-part task and trust it to plan, use tools, check its work, navigate through ambiguity, and keep going. The gains are especially strong in agentic coding, computer use, knowledge work,…
2dResearch#codingby OpenAI
2d ago
GPT-5.5 System Card
GPT‑5.5 is a new model designed for complex, real-world work, including writing code, researching online, analyzing information, creating documents and spreadsheets, and moving across tools to get things done. Relative to earlier models, GPT‑5.5 understands the task earlier, asks for less guidance, uses tools more effectively, checks it work and keeps going until it’s done. We subjected the model to our full suite of predeployment safety evaluations and our Preparedness Framework, including targeted red-teaming for advanced cybersecurity and biology capabilities, and collected feedback on real use cases from nearly 200 early-access partners before release. We are releasing GPT‑5.5 with our strongest set of safeguards to date, designed to reduce misuse while preserving legitimate, beneficial uses of advanced capabilities. We generally treat GPT‑5.5’s safety results as strong proxies for GPT‑5.5 Pro, which is the same underlying model using a setting that…
2dModel
2d ago
Lowe’s puts project expertise into every hand
Lowe’s puts project expertise into every hand With OpenAI, Lowe’s brings their Mylow Companion app to all retail associates, applying the same AI foundation behind their customer-facing Mylow virtual advisor. Home improvement projects aren’t simple shopping trips. They're major investments, often involving thousands of dollars, multiple visits, and specialized expertise. “When you’re buying a t-shirt and it doesn’t fit, you just return it. No big deal,” says Seemantini Godbole, EVP, Chief Digital and Information Officer at Lowe’s. “But if you’re renovating a kitchen or redoing your floors, those are expensive decisions. You want to feel confident. And that requires expertise.” Lowe’s Red Vest associates have long helped customers navigate that complexity. But with stores up to 130,000 square feet and tens of thousands of SKUs in store, even seasoned team members can’t know everything. And online shoppers face similar challenges:…
2d
3d ago
Introducing OpenAI Privacy Filter
Today we’re releasing OpenAI Privacy Filter, an open-weight model for detecting and redacting personally identifiable information (PII) in text. This release is part of our broader effort to support a more resilient software ecosystem by providing developers practical infrastructure for building with AI safely, including tools and models that make strong privacy and security protections easier to implement from the start. Privacy Filter is a small model with frontier personal data detection capability. It is designed for high-throughput privacy workflows, and is able to perform context-aware detection of PII in unstructured text. It can run locally, which means that PII can be masked or redacted without leaving your machine. It processes long inputs efficiently, making redaction decisions in a quick, single pass. At OpenAI, we use a fine-tuned version of Privacy Filter in our own privacy-preserving workflows. We developed Privacy…
3dOpen Source#local
3d ago
Workspace agents
Workspace agents Understand, build, and use agents for repeatable work in ChatGPT. Most ChatGPT users already know how to use AI for one-off tasks—like drafting, summarizing, brainstorming, or answering questions. The next phase of AI use is broader and more embedded in day-to-day work. Instead of helping with isolated moments, AI is increasingly being used to support repeatable workflows that depend on shared systems, standard handoffs, consistent outputs, and real-world constraints like timing, accuracy, and process. That’s where workspace agents in ChatGPT fit. They’re designed to be used for repeatable workflows—work you’d otherwise do manually, re-explaining the steps each time, and copying information between tools. Learn more about workspace agents in our blog post. If you’re new to agent building, let’s focus on the core concepts first so when you start building, you’ll know how to set up your workspace…
3dTutorial#gpt#agents
3d ago
Introducing workspace agents in ChatGPT
Introducing workspace agents in ChatGPT Codex-powered agents for teams. Today, we’re introducing workspace agents in ChatGPT. Teams can now create shared agents that handle complex tasks and long-running workflows, all while operating within the permissions and controls set by their organization. Workspace agents are an evolution of GPTs. Powered by Codex, they can take on many of the tasks people already do at work—from preparing reports, to writing code, to responding to messages. They run in the cloud, so they can keep working even when you’re not. They’re also designed to be shared within an organization, so teams can build an agent once, use it together in ChatGPT or Slack, and improve it over time. AI has already helped people work faster on their own, but many of the most important workflows inside an organization depend on shared context, handoffs,…
3dRelease#gpt#agents
3d ago
Speeding up agentic workflows with WebSockets in the Responses API
Speeding up agentic workflows with WebSockets in the Responses API By Brian Yu and Ashwin Nathan, Members of the Technical Staff When you ask Codex to fix a bug, it scans through your codebase for relevant files, reads them to build context, makes edits, and runs tests to verify the fix worked. Under the hood, that means dozens of back-and-forth Responses API requests: determine the model’s next action, run a tool on your computer, send the tool output back to the API, and repeat. All of these requests can add up to minutes that users spend waiting for Codex to complete complex tasks. From a latency perspective, the Codex agent loop spends most of its time in three main stages: working in the API services (to validate and process requests), model inference, and client-side time (running tools and building model…
3dInfra#agents
3d ago
Making ChatGPT better for clinicians
Making ChatGPT better for clinicians Built for clinical work, ChatGPT for Clinicians is now available for free to verified individual clinicians in the U.S. We’re introducing ChatGPT for Clinicians, a version of ChatGPT designed to support clinical tasks like documentation and medical research so clinicians can focus on delivering high-quality patient care. We’re making it free for any verified physician, NP, PA, or pharmacist, starting in the U.S. The U.S. healthcare system today is under extraordinary strain. Clinicians are being asked to care for more patients while managing growing administrative demands and a rapidly expanding body of medical research. Many are already turning to AI tools like ChatGPT for support. According to a 2026 survey by the American Medical Association(opens in a new window), physician use of AI is now at an all-time high, with 72% of physicians reporting they…
3dResearch#gpt
[SWB]Simon Willison Blog· 7 articlesvisit →
2d ago
Quoting Maggie Appleton
23rd April 2026 [...] if you ever needed another reason to learn in public by digital gardening or podcasting or streaming or whathaveyou, add on that people will assume you’re more competent than you are. This will get you invites to very cool exclusive events filled with high-achieving, interesting people, even though you have no right to be there. A+ side benefit. — Maggie Appleton, Gathering Structures (via) Recent articles - DeepSeek V4 - almost on the frontier, a fraction of the price - 24th April 2026 - Extract PDF text in your browser with LiteParse for the web - 23rd April 2026 - A pelican for GPT-5.5 via the semi-official Codex backdoor API - 23rd April 2026
2dTutorial
2d ago
A pelican for GPT-5.5 via the semi-official Codex backdoor API
A pelican for GPT-5.5 via the semi-official Codex backdoor API 23rd April 2026 GPT-5.5 is out. It’s available in OpenAI Codex and is rolling out to paid ChatGPT subscribers. I’ve had some preview access and found it to be a fast, effective and highly capable model. As is usually the case these days, it’s hard to put into words what’s good about it—I ask it to build things and it builds exactly what I ask for! There’s one notable omission from today’s release—the API: API deployments require different safeguards and we are working closely with partners and customers on the safety and security requirements for serving it at scale. We’ll bring GPT‑5.5 and GPT‑5.5 Pro to the API very soon. When I run my pelican benchmark I always prefer to use an API, to avoid hidden system prompts in ChatGPT…
2dInfra#gpt
2d ago
Extract PDF text in your browser with LiteParse for the web
Extract PDF text in your browser with LiteParse for the web 23rd April 2026 LlamaIndex have a most excellent open source project called LiteParse, which provides a Node.js CLI tool for extracting text from PDFs. I got a version of LiteParse working entirely in the browser, using most of the same libraries that LiteParse uses to run in Node.js. Spatial text parsing Refreshingly, LiteParse doesn’t use AI models to do what it does: it’s good old-fashioned PDF parsing, falling back to Tesseract OCR (or other pluggable OCR engines) for PDFs that contain images of text rather than the text itself. The hard problem that LiteParse solves is extracting text in a sensible order despite the infuriating vagaries of PDF layouts. They describe this as “spatial text parsing”—they use some very clever heuristics to detect things like multi-column layouts and group…
3d ago
Is Claude Code going to cost $100/month? Probably not - it's all very confusing
Is Claude Code going to cost $100/month? Probably not—it’s all very confusing 22nd April 2026 Anthropic today quietly (as in silently, no announcement anywhere at all) updated their claude.com/pricing page (but not their Choosing a Claude plan page, which shows up first for me on Google) to add this tiny but significant detail (arrow is mine, and it’s already reverted): The Internet Archive copy from yesterday shows a checkbox there. Claude Code used to be a feature of the $20/month Pro plan, but according to the new pricing page it is now exclusive to the $100/month or $200/month Max plans. Update: don’t miss the update to this post, they’ve already changed course a few hours after this change went live. So what the heck is going on? Unsurprisingly, Reddit and Hacker News and Twitter all caught fire. I didn’t believe…
3d ago
Changes to GitHub Copilot Individual plans
22nd April 2026 - Link Blog Changes to GitHub Copilot Individual plans (via) On the same day as Claude Code's temporary will-they-won't-they $100/month kerfuffle (for the moment, they won't), here's the latest on GitHub Copilot pricing. Unlike Anthropic, GitHub put up an official announcement about their changes, which include tightening usage limits, pausing signups for individual plans (!), restricting Claude Opus 4.7 to the more expensive $39/month "Pro+" plan, and dropping the previous Opus models entirely. The key paragraph: Agentic workflows have fundamentally changed Copilot’s compute demands. Long-running, parallelized sessions now regularly consume far more resources than the original plan structure was built to support. As Copilot’s agentic capabilities have expanded rapidly, agents are doing more work, and more customers are hitting usage limits designed to maintain service reliability. It's easy to forget that just six months ago heavy LLM…
3dOpen Source#claude#coding
3d ago
Quoting Bobby Holley
22nd April 2026 As part of our continued collaboration with Anthropic, we had the opportunity to apply an early version of Claude Mythos Preview to Firefox. This week’s release of Firefox 150 includes fixes for 271 vulnerabilities identified during this initial evaluation. [...] Our experience is a hopeful one for teams who shake off the vertigo and get to work. You may need to reprioritize everything else to bring relentless and single-minded focus to the task, but there is light at the end of the tunnel. We are extremely proud of how our team rose to meet this challenge, and others will too. Our work isn’t finished, but we’ve turned the corner and can glimpse a future much better than just keeping up. Defenders finally have a chance to win, decisively. — Bobby Holley, CTO, Firefox Recent articles - DeepSeek…
3dResearch#claude
3d ago
Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model
22nd April 2026 - Link Blog Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model (via) Big claims from Qwen about their latest open weight model: Qwen3.6-27B delivers flagship-level agentic coding performance, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active MoE) across all major coding benchmarks. On Hugging Face Qwen3.5-397B-A17B is 807GB, this new Qwen3.6-27B is 55.6GB. I tried it out with the 16.8GB Unsloth Qwen3.6-27B-GGUF:Q4_K_M quantized version and llama-server using this recipe by benob on Hacker News, after first installing llama-server using brew install llama.cpp : llama-server \ -hf unsloth/Qwen3.6-27B-GGUF:Q4_K_M \ --no-mmproj \ --fit on \ -np 1 \ -c 65536 \ --cache-ram 4096 -ctxcp 2 \ --jinja \ --temp 0.6 \ --top-p 0.95 \ --top-k 20 \ --min-p 0.0 \ --presence-penalty 0.0 \ --repeat-penalty 1.0 \ --reasoning on \ --chat-template-kwargs '{"preserve_thinking": true}' On first run that…
[TVA]The Verge AI· 14 articlesvisit →
2d ago
Microsoft launches ‘vibe working’ in Word, Excel, and PowerPoint
Microsoft is rolling out a new Agent Mode inside Office apps like Word, Excel, and PowerPoint this week. Previously described by Microsoft as “vibe working,” the Agent Mode is a more powerful version of the Copilot experience in Office that Microsoft has been trying to sell to businesses. Microsoft launches ‘vibe working’ in Word, Excel, and PowerPoint Copilot Agent Mode is now the default for Microsoft 365 Copilot and Premium subscribers. Copilot Agent Mode is now the default for Microsoft 365 Copilot and Premium subscribers. “When we first shipped Copilot, foundation models were not powerful enough to use Copilot to command the applications,” admits Sumit Chauhan, corporate vice president of Microsoft’s Office Product Group. “This meant Copilot was a passive partner in documents: it could answer questions but missed the mark when it was asked to take action on the…
2dAgents#agents#codingby Tom Warren
2d ago
You’re about to feel the AI money squeeze
Earlier this month, millions of OpenClaw users woke up to a sweeping mandate: The viral AI agent tool, which this year took the worldwide tech industry by storm, had been severely restricted by Anthropic. Anthropic, like other leading AI labs, was under immense pressure to lessen the strain on its systems and start turning a profit. So if the users wanted its Claude AI to power their popular agents, they’d have to start paying handsomely for the privilege. “Our subscriptions weren’t built for the usage patterns of these third-party tools,” wrote Boris Cherny, head of Claude Code, on X. “We want to be intentional in managing our growth to continue to serve our customers sustainably long-term. This change is a step toward that.” The announcement was a sign of the times. Investors have poured hundreds of billions of dollars into…
2dAgents#agentsby Hayden Field
2d ago
THE PEOPLE DO NOT YEARN FOR AUTOMATION
Today on Decoder, I want to lay out an idea that’s been banging around my head for weeks now as we’ve been reporting on AI and having conversations here on this show. I’ve been calling it software brain, and it’s a particular way of seeing the world that fits everything into algorithms, databases and loops — software. THE PEOPLE DO NOT YEARN FOR AUTOMATION Software brain is changing the world, but most people still aren’t buying. Software brain is powerful stuff. It’s a way of thinking that basically created our modern world. Marc Andreessen, the literal embodiment of software brain, called it in 2011 when he wrote the piece “Why software is eating the world” as an op-ed in The Wall Street Journal. But software thinking has been turbocharged by AI in a way that I think helps explain the…
2dby Nilay Patel
2d ago
OpenAI says its new GPT-5.5 model is more efficient and better at coding
OpenAI just announced its new GPT-5.5 model, which the company calls its “smartest and most intuitive to use model yet, and the next step toward a new way of getting work done on a computer.” OpenAI just released GPT-5.4 last month, but says that the new GPT-5.5 “excels” at tasks like writing and debugging code, doing research online, making spreadsheets and documents, and doing that work across different tools. OpenAI says its new GPT-5.5 model is more efficient and better at coding The new model ‘excels’ at tasks like writing and debugging code and doing work across different tools. The new model ‘excels’ at tasks like writing and debugging code and doing work across different tools. “Instead of carefully managing every step, you can give GPT-5.5 a messy, multi-part task and trust it to plan, use tools, check its work,…
2dInfra#codingby Hayden Field
2d ago
Anthropic’s Mythos breach was humiliating
Anthropic’s tightly controlled rollout of Claude Mythos has taken an awkward turn. After spending weeks insisting the AI model is so capable at cybersecurity that it is too dangerous to release publicly, it appears the model fell into the wrong hands anyway. Anthropic’s Mythos breach was humiliating There’s no good excuse for letting hackers into an AI model too dangerous for public release. There’s no good excuse for letting hackers into an AI model too dangerous for public release. According to Bloomberg, a “small group of unauthorized users” has had access to Mythos — whose existence was first revealed in a leak — since the day Anthropic announced plans to offer it to a select group of companies for testing. Anthropic says it is investigating. That’s a rough look for a company that has built its brand on taking AI…
2dModel#claudeby Robert Hart
2d ago
Meta is laying off 10 percent of its staff
Meta is planning to layoff around 10 percent of employees in May, according to a memo from the company’s chief people officer, Janelle Gale, published by Bloomberg. That means approximately 8,000 people will see their jobs cut. Meta will also be closing around 6,000 open roles, according to Gale. Meta is laying off 10 percent of its staff Meta is making the cuts to help ‘offset the other investments we’re making.’ Meta is making the cuts to help ‘offset the other investments we’re making.’ The cuts follow Meta’s significant investments in AI, including spending huge sums to hire top talent and build data centers. The company forecast in January that it will spend $115 billion to $135 billion in capital expenditures in 2026 — a significant increase from its $72.22 billion in capital expenditures for 2025. The increase is to…
2dModelby Jay Peters
2d ago
Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax
Claude users can access more apps with Anthropic’s AI now thanks to new connectors for everything from hiking to grocery shopping. Anthropic already supported connecting numerous work-related apps to Claude, like Microsoft apps, but this expansion focuses on personal apps like Audible, Spotify, Uber, AllTrails, TripAdvisor, Instacart, TurboTax, and others. Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax Anthropic says the new app connectors are available to all Claude users, ‘with mobile in beta.’ Anthropic says the new app connectors are available to all Claude users, ‘with mobile in beta.’ Some of these apps, such as Spotify, already have similar connectors in OpenAI’s ChatGPT. Once an app is connected, Claude will suggest relevant connected apps directly in your conversations, like using AllTrails for hike recommendations. Anthropic notes in its blog post announcing the new…
2dModel#claudeby Stevie Bonifield
3d ago
Anthropic’s most dangerous AI model just fell into the wrong hands
Anthropic’s Mythos AI model, a powerful cybersecurity tool that the company said could be dangerous in the wrong hands, has been accessed by a “small group of unauthorized users,” Bloomberg reports. An unnamed member of the group, identified only as “a third-party contractor for Anthropic,” told the publication that members of a private online forum got into Mythos via a mix of tactics, utilizing the contractor’s access and “commonly used internet sleuthing tools.” Anthropic’s most dangerous AI model just fell into the wrong hands A Discord group has had access to the Mythos model for two weeks. A Discord group has had access to the Mythos model for two weeks. The Claude Mythos Preview is a new general-purpose model that’s capable of identifying and exploiting vulnerabilities “in every major operating system and every major web browser when directed by a…
3dModelby Jess Weatherbed
3d ago
Now Meta will track what employees do on their computers to train its AI agents
Meta employees’ activity at work is now being used to train the company’s AI agents. As reported by Reuters, Meta is installing a tool it calls Model Capability Initiative (MCI) on US-based employees’ computers that runs in work-related apps and websites, recording mouse movements, clicks, keystrokes, and occasional screenshots. Now Meta will track what employees do on their computers to train its AI agents The ‘Model Capability Initiative’ records mouse activity, keystrokes, and screenshots to use as AI training data. The ‘Model Capability Initiative’ records mouse activity, keystrokes, and screenshots to use as AI training data. The data from this tool will be used to train the company’s AI models to get better at interacting with computers the way humans do, including automating work tasks like those Meta’s employees perform on the job. According to Reuters, the data from MCI…
3dInfraby Stevie Bonifield
3d ago
Google Meet will take AI notes for in-person meetings too
Google’s AI meeting notetaker is no longer limited to Google Meets — Gemini can also generate summaries and transcripts of in-person meetings now, as well as meetings on Zoom and Microsoft Teams, as first reported by 9to5Google. Google Meet will take AI notes for in-person meetings too Users can also get AI summaries and transcripts for meetings in Zoom and Teams. Users can also get AI summaries and transcripts for meetings in Zoom and Teams. Support for in-person meetings was previously limited to alpha users and only available on Android. Google’s support page for the feature notes that, “If a user who is not in person wants to join the meeting, you can transition the meeting to a normal video call.” The feature also works for impromptu meetings — Google says you “don’t need to be in a meeting room”…
3dModel#geminiby Stevie Bonifield
3d ago
Anthropic’s Mythos rollout has missed America’s cybersecurity agency
Several US federal agencies are taking up Anthropic’s new cybersecurity model to find vulnerabilities, but one is reportedly not getting in on the action: the nation’s central cybersecurity coordinator. Anthropic’s Mythos rollout has missed America’s cybersecurity agency CISA, embattled under the Trump administration, reportedly hasn’t gotten Anthropic’s powerful AI. CISA, embattled under the Trump administration, reportedly hasn’t gotten Anthropic’s powerful AI. On Tuesday, Axios reported that the Cybersecurity and Infrastructure Security Agency (CISA) didn’t have access to Mythos Preview, which Anthropic has touted as a powerful tool for finding and patching security vulnerabilities. Meanwhile, other agencies like Commerce Department and National Security Agency (NSA) are reportedly using the model, and President Donald Trump’s administration has been negotiating broader access, Axios wrote last week. In a blog post, Anthropic said it’s “been in ongoing discussions with US government officials about Claude…
3dInfraby Lauren Feiner
3d ago
Watch Sony’s elite ping-pong robot beat top-ranked players
Humans have been building ping-pong playing robots for decades, such as Omron’s FORPHEUS that challenged amateur competitors at CES 2017. What sets Ace apart from the rest is that the robot, which was developed by Sony’s AI division, is the first that can hold its own against top-ranked human players and occasionally even beat them in matches that follow the official rules of the International Table Tennis Federation (ITTF). Watch Sony’s elite ping-pong robot beat top-ranked players Ace is an AI-powered articulated robot that uses 12 cameras to compete against top table tennis players. Ace is an AI-powered articulated robot that uses 12 cameras to compete against top table tennis players. AI is already capable of besting humans at games like Chess and Go, but physical games pose a much greater challenge as robots have to be engineered to match…
3d#multimodalby Andrew Liszewski
3d ago
OpenAI now lets teams make custom bots that can do work on their own
OpenAI is giving users of its Business, Enterprise, Edu, and Teachers plans access to cloud-based “workspace” agents available in ChatGPT that can perform business tasks. In its blog post, OpenAI gives examples of agents like one that finds product feedback on the web and sends a report in Slack and a sales agent that can draft follow-up emails in Gmail. OpenAI now lets teams make custom bots that can do work on their own The new workspace agents can perform tasks like reporting on product feedback on their own in the cloud. The new workspace agents can perform tasks like reporting on product feedback on their own in the cloud. These new agents follow increasing interest in agents across the AI landscape, especially after OpenClaw — the AI agent formerly known as Clawdbot and Moltbot that touts itself as the…
3dAgents#gpt#agentsby Jay Peters
3d ago
AI failure could trigger the next financial crisis, warns Elizabeth Warren
“I know a bubble when I see one.” AI failure could trigger the next financial crisis, warns Elizabeth Warren ‘The first big stumble will have everyone running for the exits.’ ‘The first big stumble will have everyone running for the exits.’ That’s what Sen. Elizabeth Warren (D-MA), who led the push to create a new consumer financial regulator in the wake of the 2008 recession, told a crowd at a Vanderbilt Policy Accelerator event in Washington, DC, on Wednesday. Warren warned of what she called “striking” parallels to that crisis in the AI industry. While she believes the technology has “enormous potential,” she warned that AI companies’ massive spending and borrowing practices are creating a tinderbox and Congress should step in. Though the AI industry has grown rapidly, Warren said the pace isn’t keeping up with their spending, requiring them…
3dHardwareby Lauren Feiner
[WA]Wired AI· 6 articlesvisit →
2d ago
Apple’s Next Chapter, SpaceX and Cursor Strike a Deal, and Palantir’s Controversial Manifesto
This week on Uncanny Valley, the team discusses what’s next for Apple as Tim Cook steps down from his role as CEO. They also go into the reasoning behind SpaceX and Cursor’s surprising deal, and why Palantir’s self-published manifesto drew a lot of heat online. Also, we discuss why some conspiracy theorists are leaving Trump’s side, and how a scammer created an AI-generated woman to attract and grift MAGA men. Articles mentioned in this episode: - Tim Cook’s Legacy Is Turning Apple Into a Subscription - MAGA Is Starting to Look Beyond Trump - This Scammer Used an AI-Generated MAGA Girl to Grift ‘Super Dumb’ Men You can follow Brian Barrett on Bluesky at @brbarrett, Zoë Schiffer on Bluesky at @zoeschiffer, and Leah Feiger on Bluesky at @leahfeiger. Write to us at [email protected]. How to Listen You can always…
2dTutorialby Brian Barrett, Zoë Schiffer, Leah Feiger
3d ago
The Pope’s Warnings About AI Were AI-Generated, a Detection Tool Claims
On Monday, a brand-new Reddit account popped up on the widely read forum r/AmItheAsshole, where users have their personal disputes arbitrated by strangers. This particular user asked if they had crossed a line by “refusing to babysit my stepmother’s kids because I have my own job and responsibilities.” The post itself was succinct, straightforward, and grammatically clean, explaining a situation in which the person’s stepmother and father often expected them to provide childcare on little notice, eventually leading to an argument. “Now there’s tension at home, and I’m starting to wonder if I handled it the wrong way,” the redditor concluded. “I do understand that raising kids is stressful, but I also feel like I shouldn’t be obligated to take on that responsibility when it’s not my role.” The responses to this individual were largely supportive: The kids were not…
3dby Miles Klee
3d ago
Join Our Livestream: Musk v. Altman and the Future of OpenAI
Two of Big Tech’s most influential billionaires, Sam Altman and Elon Musk, will go head-to-head in a highly anticipated trial beginning April 27. In Musk v. Altman, a judge, advised by a jury, will ultimately determine whether OpenAI has strayed from its founding mission to ensure that artificial general intelligence (AGI) benefits humanity, and the ruling could influence how the world’s leading AI developer controls and distributes its technology. For now, you can learn more about the trial here. On May 8, a panel of WIRED experts will go live to answer your questions about this consequential case. - Zoë Schiffer: WIRED's director of business and industry, who oversees coverage of business and Silicon Valley. - Maxwell Zeff: a senior writer at WIRED covering the business of artificial intelligence. He writes the weekly Model Behavior newsletter, which focuses on the…
3dTutorial#rag#codingby Zoë Schiffer, Paresh Dave, Maxwell Zeff
3d ago
AI Tools Are Helping Mediocre North Korean Hackers Steal Millions
The advent of AI hacking tools has raised fears of a near future in which anyone can use automated tools to dig up exploitable vulnerabilities in any piece of software, like a kind of digital intrusion superpower. Here in the present, however, AI seems to be playing a more mundane, if still concerning, role in hackers’ toolkit: It’s helping mediocre hackers level up and carry out broad, effective malware campaigns. That includes one group of relatively unskilled North Korean cybercriminals who’ve been discovered using AI to carry out virtually every part of an operation that hacked thousands of victims to steal their cryptocurrency. On Wednesday, cybersecurity firm Expel revealed what it describes as a North Korean state-sponsored cybercrime operation that installed credential-stealing malware on more than 2,000 computers, specifically targeting the machines of developers working on small cryptocurrency launches, NFT…
3dInfra#codingby Andy Greenberg, Matt Burgess
3d ago
5 AI Models Tried to Scam Me. Some of Them Were Scary Good
I recently witnessed how scary-good artificial intelligence is getting at the human side of computer hacking, when the following message popped up on my laptop screen: Hi Will, I’ve been following your AI Lab newsletter and really appreciate your insights on open-source AI and agent-based learning—especially your recent piece on emergent behaviors in multi-agent systems. I’m working on a collaborative project inspired by OpenClaw, focusing on decentralized learning for robotics applications. We’re looking for early testers to provide feedback, and your perspective would be invaluable. The setup is lightweight—just a Telegram bot for coordination—but I’d love to share details if you’re open to it. The message was designed to catch my attention by mentioning several things I am very into: decentralized machine learning, robotics, and the creature of chaos that is OpenClaw. Over several emails, the correspondent explained that his…
3dInfra#agents#open-sourceby Will Knight
3d ago
Sam Altman’s Orb Company Promoted a Bruno Mars Partnership That Doesn't Exist
Sam Altman’s eyeball-scanning startup, Tools for Humanity, announced last week that a new product called Concert Kit—designed to give verified humans a way to purchase concert tickets—would first roll out on Bruno Mars’ world tour of his latest studio album, The Romantic. However, Bruno Mars Management and Live Nation, the producer for the Romantic Tour, told WIRED in a joint statement on Tuesday that the partnership “does not exist,” and that Tools for Humanity never even approached them about working together. The confusion stemmed from a Tools for Humanity event April 17 in San Francisco, where chief product officer Tiago Sada said the company would be joining the Romantic Tour to not just provide access to tickets but also “VIP experiences for verified humans.” The statement was reiterated in a blog post published by the company, which read: “Concert Kit…
3dby Maxwell Zeff, Lauren Goode