AI Assistants
50 items tagged with this topic
Recent
Older
Agents will use software 100X more than people. When that happens, theres a huge need for guard…
Agents will use software 100X more than people. When that happens, theres a huge need for guardrails on what the agents are doing so they don’t leak data or change the wrong information, authoritative sources of truth f…
Introducing Claude Opus 4.8 \ Anthropic
Our latest model, Claude Opus 4.8, is an upgrade to our Opus class of models, with stronger performance across coding, agentic tasks, and professional work, and the consistency to handle long-running work.
Securing the future of AI agents
Securing internal systems with an AI Control Roadmap, combining traditional safeguards and real-time monitoring.
Remote agents in Vibe. Powered by Mistral Medium 3.5. | Mistral AI
Introducing Mistral Medium 3.5, remote coding agents in Vibe, plus new Work mode in Le Chat for complex tasks.
Introducing Mistral Small 4 | Mistral AI
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
Introducing Mistral 3 | Mistral AI
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
Speaking of Voxtral | Mistral AI
Voxtral TTS: A frontier, open-weights text-to-speech model that’s fast, instantly adaptable, and produces lifelike speech for voice agents.
Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan
OpenAI boardmember Zico Kolter and Gray Swan CEO Matt Fredrikson join swyx to explain why AI security is not just “cybersecurity with AI”
Hey devs at @Outlook and @gmail you can point your agents at this tweet and they will fix it fo…
Hey devs at @Outlook and @gmail you can point your agents at this tweet and they will fix it for you
The Product role is having an identity crisis too. Engineering has found its AI-native interfac…
The Product role is having an identity crisis too. Engineering has found its AI-native interface - SWE agents dramatically increase individual leverage. Companies are asking PMs to use AI, but they haven't evolved the r…
An update on recent Claude Code quality reports
Over the past month, we’ve been looking into reports that Claude’s responses have worsened for some users. We’ve traced these reports to three separate changes that affected Claude Code, the Claude Agent SDK, and Claude Cowork. The API was…
Scaling Managed Agents: Decoupling the brain from the hands
Get started with Claude Managed Agents by following our docs . A running topic on the Engineering Blog is how to build effective agents and design harnesses for long-running work . A common thread across this work is that harnesses encode…
A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry
OpenAI and Molecule.one show how a near-autonomous AI chemist using GPT-5.4 improved a key drug-making reaction, advancing medicinal chemistry research.
The next hot programming language is… markdown. A minimal eve agent: 📂 𝚊𝚐𝚎𝚗𝚝/ 📄 𝚒𝚗𝚜𝚝…
The next hot programming language is… markdown. A minimal eve agent: 📂 𝚊𝚐𝚎𝚗𝚝/ 📄 𝚒𝚗𝚜𝚝𝚛𝚞𝚌𝚝𝚒𝚘𝚗𝚜.𝚖𝚍 📂 𝚜𝚔𝚒𝚕𝚕𝚜/ 📄 𝚢𝚘𝚞𝚛-𝚎𝚡𝚙𝚎𝚛𝚝𝚒𝚜𝚎.𝚖𝚍 Deployable in one command: 𝚟𝚎𝚛𝚌𝚎𝚕. It’s the…
Agents are motivating so many healthy software habits. Open APIs, documentation (skills), tests…
Agents are motivating so many healthy software habits. Open APIs, documentation (skills), tests (evals), Unix (CLIs), payment & commerce protocols, even wide 𝙰𝚌𝚌𝚎𝚙𝚝 use (markdown/json/html). The original vision of…
The main variable in getting success with agents is whether you can get the agent the context i…
The main variable in getting success with agents is whether you can get the agent the context it needs to do its work; and a major factor in that is if you can create a shared working area for that agent that a human ca…
Hannes speaks both developer and agents. Blessed to have him on the team! https://t.co/bTrqd3V3…
Hannes speaks both developer and agents. Blessed to have him on the team! https://t.co/bTrqd3V3dn
Re-engineering the Semiconductor Supply Chain with Intel CEO Lip Bu Tan
Speaker 1 | 00:00 - 00:28 Nine of the 10 company I invest, halfway they change their business plan because market have changed. Yeah. So I like to have entrepreneur as team, not just one person. I always believed in when I was at Cadence a…
How we contain Claude across products
Twelve months ago, we'd have rejected out of hand the idea of granting Claude access sufficient to take down an internal Anthropic service. Today that level of access is routine, and Anthropic developers are more productive for it. Th…
It took us a lot of iterations to find the right pattern for this. Early on we were very naive…
It took us a lot of iterations to find the right pattern for this. Early on we were very naive and we thought that the agent could one shot project updates without much input from users. That approach ended up creating…
[AINews] GLM-5.2: the top Frontend Coding model in the world, IndexShare for Speculative Decoding
We have a new top open model in the world!
React → https://t.co/a4QDSs9wxd Next.js → https://t.co/nDDXqUmgw5 @aisdk is more relevant than…
React → https://t.co/a4QDSs9wxd Next.js → https://t.co/nDDXqUmgw5 @aisdk is more relevant than ever, given the intense model competition landscape. Just today, GLM 5.2, an open model, surpassed Opus 4.8 in our Next.js E…
The past couple months we may be witnessing what the Applied AI layer will look like at scale.…
The past couple months we may be witnessing what the Applied AI layer will look like at scale. Despite some of the initial critique that this would just be a thin layer on the LLM, it’s turning out that actually driving…
GitHub’s COO Explains Why AI Hasn’t Replaced Developers
Speaker 1 | 00:00 - 00:36 Hi, I'm Mike Taylor. I'm the head of tech consulting at Every, and I sat down with Kyle Daigle, the COO of GitHub, and talked to him about what is happening on the front lines of coding agents. We have 17,000,000…
Ire identifies another LOTUSLITE specimen
Project Ire examined a timely malware sample and determined its intent through reverse engineering—identifying LOTUSLITE characteristics even as most major EDR tools did not detect it. The post Ire identifies another LOTUSLITE specimen app…
Sen. Slotkin: NDAA, AI guardrails, and banning China's cars
+ does Jordan "need a life"?
The real prize in the SpaceX-Cursor deal is the agentic harness that will become the core for a…
The real prize in the SpaceX-Cursor deal is the agentic harness that will become the core for automating all knowledge work at scale. Here’s what SpaceX is getting: 1. Production-grade agentic harness -planning, context…
Every other product right now is "an AI agent that does everything in your work & life & integr…
Every other product right now is "an AI agent that does everything in your work & life & integrates with everything." Cool, that's just Claude/Codex. If you want me to use your thing instead, it needs an opinion & a sou…
Simulating Humans at Scale: Simile's Joon Sung Park
Speaker 1 | 00:00 - 00:35 I am somebody who is quite inspired by science fiction. And when you read science fiction that covers societies that have progressed far enough in its technological maturity, you always see two pillars. You have s…
New OpenAI Academy courses for the next era of work
OpenAI introduces three Academy courses that help people build practical AI skills, create repeatable workflows, and apply agents in everyday work.
[AINews] Loopcraft: The Art of Stacking Loops
a quiet day lets us highlight a great concept from Peter Steinberger, Boris Cherny, and Andrej Karpathy
I absolutely love Replit’s domain-specific agents: - growth agent surfacing SEO issues - securi…
I absolutely love Replit’s domain-specific agents: - growth agent surfacing SEO issues - security agent surfacing potential vulnerabilities My favorite thing is: select all, fix with Agent. https://t.co/7TrK2UpNXQ
Investing in multi-agent AI safety research
Google DeepMind and partners announce a $10M funding call for multi-agent safety research.
[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo
a quiet day lets us reflect on a great essay
[AINews] FrontierCode: Benchmarking for Code Quality over Slop
We made a thing!
We just shipped 𝙷𝚊𝚛𝚗𝚎𝚜𝚜𝙰𝚐𝚎𝚗𝚝, a unified abstraction to orchestrate and integrate an…
We just shipped 𝙷𝚊𝚛𝚗𝚎𝚜𝚜𝙰𝚐𝚎𝚗𝚝, a unified abstraction to orchestrate and integrate any agent’s “brain” into your app. @aisdk now frees you from both model and agent lock-in. (And it doesn’t just get you portab…
Fable feels superhuman at working over long agentic conversations, sometimes to the point where…
Fable feels superhuman at working over long agentic conversations, sometimes to the point where I can't keep up with what it's telling me 😅 This prompt snippet has been the best fix I've found for getting it to write c…
AI Vibe Check: Lab Wars, Why APIs Might Vanish & Future Predictions
Speaker 1 | 00:00 - 00:22 I'm Jacob Efron, and this is unsupervised learning. We've had a bunch of new subscribers over our last few months, and so wanted to welcome you to the show. We basically probe the sharpest minds in AI on everythin…
OpenAI to acquire Ona
OpenAI plans to acquire Ona to expand Codex with secure, persistent cloud environments, enabling long-running AI agents across enterprise workflows.
Replit Agent team did a great job making Fable cost stomachable. The lack of mistakes net net m…
Replit Agent team did a great job making Fable cost stomachable. The lack of mistakes net net makes it more affordable. https://t.co/ICkFYKxqYt
At Box, we just surveyed 1,640 IT leaders across the US, Japan, and Europe about agentic AI ado…
At Box, we just surveyed 1,640 IT leaders across the US, Japan, and Europe about agentic AI adoption. Many standout findings, but a big one was that the companies that adopted AI the most are planning to grow headcount…
Google DeepMind's Logan Kilpatrick: Why the Model Eats the Harness
Speaker 1 | 00:00 - 00:02 So we could edit this set so it looks like we're Speaker 2 | 00:02 - 00:06 here. Okay? Yeah. Yeah. I I want this where where we were talking off camera. Speaker 2 | 00:06 - 00:36 Like, we should do that for the in…
Super interesting approach to enterprise agents. Congrats on the launch @markiewagner https://t…
Super interesting approach to enterprise agents. Congrats on the launch @markiewagner https://t.co/BfGani53y1
Lots of evidence of huge jumps in capability for Fable across coding (and related) tasks. It’s…
Lots of evidence of huge jumps in capability for Fable across coding (and related) tasks. It’s also a major jump in accuracy and success in complex knowledge work tasks. In our Box AI Complex Work Eval, we tested the mo…
Nessie just became the best way to get all your existing context, memory and history from ChatG…
Nessie just became the best way to get all your existing context, memory and history from ChatGPT, Perplexity, and Gemini into all the other places you have memory, and also get it into OpenClaw/Hermes Agent. Their Open…
This is so good Increasingly the output of an agency looks like a folder of files for agents, i…
This is so good Increasingly the output of an agency looks like a folder of files for agents, instead of one-off assets "Get paid for your mind, not your hands" https://t.co/C1O4J2HW2F https://t.co/WMf0DetvTY
People should build agents/skills for their cross-functional teams. For example, if a design te…
People should build agents/skills for their cross-functional teams. For example, if a design team builds a design agent/skill for the marketing team (trained on all of the brand's guidelines and design patterns), then t…