AI Assistants

50 items tagged with this topic

AI Assistants50

Recent

BuildersfromXJun 23

Almost all AI model and agent progress is downstream from evals. Open weights post training for…

AI Assistants Business AI Open Source

BuildersfromXJun 22

We heard that HTML is a big deal again. You can now preview, edit, manage versions, and securel…

AI Assistants

Official SourcesfromAnthropic Newsroom

Introducing Claude Opus 4.8 \ Anthropic

Our latest model, Claude Opus 4.8, is an upgrade to our Opus class of models, with stronger performance across coding, agentic tasks, and professional work, and the consistency to handle long-running work.

AI Assistants Coding

Older

Official Sourcesfrom Mistral AI Blog

Remote agents in Vibe. Powered by Mistral Medium 3.5. | Mistral AI

Introducing Mistral Medium 3.5, remote coding agents in Vibe, plus new Work mode in Le Chat for complex tasks.

Official Sourcesfrom Mistral AI Blog

Introducing Mistral Small 4 | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

Official Sourcesfrom Mistral AI Blog

Speaking of Voxtral | Mistral AI

Voxtral TTS: A frontier, open-weights text-to-speech model that’s fast, instantly adaptable, and produces lifelike speech for voice agents.

Podcasts & Newslettersfrom Latent Space NewsletterJun 22, 2026

Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan

OpenAI boardmember Zico Kolter and Gray Swan CEO Matt Fredrikson join swyx to explain why AI security is not just “cybersecurity with AI”

Buildersfrom XJun 21, 2026

Why HTML turned out to be the foundation for agentic video making from @liu8in: “We’ve been try…

Why HTML turned out to be the foundation for agentic video making from @liu8in: “We’ve been trying to build a video agent. However, we learned the hard way that agents have no visual intelligence. So that’s when we turn…

Buildersfrom XJun 21, 2026

Coding agents will squeeze every ounce of IKEA effect out of you, if you let them.

Buildersfrom XJun 22, 2026

Another new idea to push the state of AI architectures forward. Sakana released a model that ef…

Another new idea to push the state of AI architectures forward. Sakana released a model that effectively uses a mixture of models to get work done. You get a single API but then the work gets farmed out the model that b…

Buildersfrom XJun 22, 2026

Agents will use software 100X more than people. When that happens, theres a huge need for guard…

Agents will use software 100X more than people. When that happens, theres a huge need for guardrails on what the agents are doing so they don’t leak data or change the wrong information, authoritative sources of truth f…

Official Sourcesfrom Google DeepMind BlogJun 16, 2026

Securing the future of AI agents

Securing internal systems with an AI Control Roadmap, combining traditional safeguards and real-time monitoring.

Official Sourcesfrom Mistral AI Blog

Introducing Mistral 3 | Mistral AI

The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.

Buildersfrom XJun 20, 2026

Hey devs at @Outlook and @gmail you can point your agents at this tweet and they will fix it fo…

Hey devs at @Outlook and @gmail you can point your agents at this tweet and they will fix it for you

Buildersfrom XJun 20, 2026

The Product role is having an identity crisis too. Engineering has found its AI-native interfac…

The Product role is having an identity crisis too. Engineering has found its AI-native interface - SWE agents dramatically increase individual leverage. Companies are asking PMs to use AI, but they haven't evolved the r…

Watchlistfrom Anthropic Engineering

An update on recent Claude Code quality reports

Over the past month, we’ve been looking into reports that Claude’s responses have worsened for some users. We’ve traced these reports to three separate changes that affected Claude Code, the Claude Agent SDK, and Claude Cowork. The API was…

Watchlistfrom Anthropic Engineering

Scaling Managed Agents: Decoupling the brain from the hands

Get started with Claude Managed Agents by following our docs . A running topic on the Engineering Blog is how to build effective agents and design harnesses for long-running work . A common thread across this work is that harnesses encode…

Official Sourcesfrom OpenAI NewsJun 17, 2026

A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry

OpenAI and Molecule.one show how a near-autonomous AI chemist using GPT-5.4 improved a key drug-making reaction, advancing medicinal chemistry research.

Buildersfrom XJun 20, 2026

The next hot programming language is… markdown. A minimal eve agent: 📂 𝚊𝚐𝚎𝚗𝚝/ 📄 𝚒𝚗𝚜𝚝…

The next hot programming language is… markdown. A minimal eve agent: 📂 𝚊𝚐𝚎𝚗𝚝/ 📄 𝚒𝚗𝚜𝚝𝚛𝚞𝚌𝚝𝚒𝚘𝚗𝚜.𝚖𝚍 📂 𝚜𝚔𝚒𝚕𝚕𝚜/ 📄 𝚢𝚘𝚞𝚛-𝚎𝚡𝚙𝚎𝚛𝚝𝚒𝚜𝚎.𝚖𝚍 Deployable in one command: 𝚟𝚎𝚛𝚌𝚎𝚕. It’s the…

Buildersfrom XJun 19, 2026

Agents are motivating so many healthy software habits. Open APIs, documentation (skills), tests…

Agents are motivating so many healthy software habits. Open APIs, documentation (skills), tests (evals), Unix (CLIs), payment & commerce protocols, even wide 𝙰𝚌𝚌𝚎𝚙𝚝 use (markdown/json/html). The original vision of…

Buildersfrom XJun 19, 2026

The main variable in getting success with agents is whether you can get the agent the context i…

The main variable in getting success with agents is whether you can get the agent the context it needs to do its work; and a major factor in that is if you can create a shared working area for that agent that a human ca…

Buildersfrom XJun 20, 2026

Hannes speaks both developer and agents. Blessed to have him on the team! https://t.co/bTrqd3V3…

Hannes speaks both developer and agents. Blessed to have him on the team! https://t.co/bTrqd3V3dn

Podcasts & Newslettersfrom No PriorsJun 18, 2026

Re-engineering the Semiconductor Supply Chain with Intel CEO Lip Bu Tan

Speaker 1 | 00:00 - 00:28 Nine of the 10 company I invest, halfway they change their business plan because market have changed. Yeah. So I like to have entrepreneur as team, not just one person. I always believed in when I was at Cadence a…

Watchlistfrom Anthropic Engineering

How we contain Claude across products

Twelve months ago, we'd have rejected out of hand the idea of granting Claude access sufficient to take down an internal Anthropic service. Today that level of access is routine, and Anthropic developers are more productive for it. Th…

Buildersfrom XJun 18, 2026

It took us a lot of iterations to find the right pattern for this. Early on we were very naive…

It took us a lot of iterations to find the right pattern for this. Early on we were very naive and we thought that the agent could one shot project updates without much input from users. That approach ended up creating…

Podcasts & Newslettersfrom Latent Space NewsletterJun 17, 2026

[AINews] GLM-5.2: the top Frontend Coding model in the world, IndexShare for Speculative Decoding

We have a new top open model in the world!

Buildersfrom XJun 17, 2026

React → https://t.co/a4QDSs9wxd Next.js → https://t.co/nDDXqUmgw5 @aisdk is more relevant than…

React → https://t.co/a4QDSs9wxd Next.js → https://t.co/nDDXqUmgw5 @aisdk is more relevant than ever, given the intense model competition landscape. Just today, GLM 5.2, an open model, surpassed Opus 4.8 in our Next.js E…

Buildersfrom XJun 18, 2026

The past couple months we may be witnessing what the Applied AI layer will look like at scale.…

The past couple months we may be witnessing what the Applied AI layer will look like at scale. Despite some of the initial critique that this would just be a thin layer on the LLM, it’s turning out that actually driving…

Podcasts & Newslettersfrom AI & I by EveryJun 17, 2026

GitHub’s COO Explains Why AI Hasn’t Replaced Developers

Speaker 1 | 00:00 - 00:36 Hi, I'm Mike Taylor. I'm the head of tech consulting at Every, and I sat down with Kyle Daigle, the COO of GitHub, and talked to him about what is happening on the front lines of coding agents. We have 17,000,000…

Official Sourcesfrom Microsoft Research BlogJun 12, 2026

Ire identifies another LOTUSLITE specimen

Project Ire examined a timely malware sample and determined its intent through reverse engineering—identifying LOTUSLITE characteristics even as most major EDR tools did not detect it. The post Ire identifies another LOTUSLITE specimen app…

Podcasts & Newslettersfrom ChinaTalkJun 12, 2026

Sen. Slotkin: NDAA, AI guardrails, and banning China's cars

+ does Jordan "need a life"?

Buildersfrom XJun 16, 2026

The real prize in the SpaceX-Cursor deal is the agentic harness that will become the core for a…

The real prize in the SpaceX-Cursor deal is the agentic harness that will become the core for automating all knowledge work at scale. Here’s what SpaceX is getting: 1. Production-grade agentic harness -planning, context…

Buildersfrom XJun 16, 2026

Every other product right now is "an AI agent that does everything in your work & life & integr…

Every other product right now is "an AI agent that does everything in your work & life & integrates with everything." Cool, that's just Claude/Codex. If you want me to use your thing instead, it needs an opinion & a sou…

Podcasts & Newslettersfrom Training DataJun 16, 2026

Simulating Humans at Scale: Simile's Joon Sung Park

Speaker 1 | 00:00 - 00:35 I am somebody who is quite inspired by science fiction. And when you read science fiction that covers societies that have progressed far enough in its technological maturity, you always see two pillars. You have s…

Official Sourcesfrom OpenAI NewsJun 12, 2026

New OpenAI Academy courses for the next era of work

OpenAI introduces three Academy courses that help people build practical AI skills, create repeatable workflows, and apply agents in everyday work.

Podcasts & Newslettersfrom Latent Space NewsletterJun 12, 2026

[AINews] Loopcraft: The Art of Stacking Loops

a quiet day lets us highlight a great concept from Peter Steinberger, Boris Cherny, and Andrej Karpathy

Buildersfrom XJun 16, 2026

I absolutely love Replit’s domain-specific agents: - growth agent surfacing SEO issues - securi…

I absolutely love Replit’s domain-specific agents: - growth agent surfacing SEO issues - security agent surfacing potential vulnerabilities My favorite thing is: select all, fix with Agent. https://t.co/7TrK2UpNXQ

Official Sourcesfrom Google DeepMind BlogJun 10, 2026

Investing in multi-agent AI safety research

Google DeepMind and partners announce a $10M funding call for multi-agent safety research.

Podcasts & Newslettersfrom Latent Space NewsletterJun 11, 2026

[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo

a quiet day lets us reflect on a great essay

Podcasts & Newslettersfrom Latent Space NewsletterJun 9, 2026

[AINews] FrontierCode: Benchmarking for Code Quality over Slop

We made a thing!

Buildersfrom XJun 12, 2026

We just shipped 𝙷𝚊𝚛𝚗𝚎𝚜𝚜𝙰𝚐𝚎𝚗𝚝, a unified abstraction to orchestrate and integrate an…

We just shipped 𝙷𝚊𝚛𝚗𝚎𝚜𝚜𝙰𝚐𝚎𝚗𝚝, a unified abstraction to orchestrate and integrate any agent’s “brain” into your app. @aisdk now frees you from both model and agent lock-in. (And it doesn’t just get you portab…

Buildersfrom XJun 12, 2026

Fable feels superhuman at working over long agentic conversations, sometimes to the point where…

Fable feels superhuman at working over long agentic conversations, sometimes to the point where I can't keep up with what it's telling me 😅 This prompt snippet has been the best fix I've found for getting it to write c…

Podcasts & Newslettersfrom Unsupervised LearningJun 12, 2026

AI Vibe Check: Lab Wars, Why APIs Might Vanish & Future Predictions

Speaker 1 | 00:00 - 00:22 I'm Jacob Efron, and this is unsupervised learning. We've had a bunch of new subscribers over our last few months, and so wanted to welcome you to the show. We basically probe the sharpest minds in AI on everythin…

Official Sourcesfrom OpenAI NewsJun 11, 2026

OpenAI to acquire Ona

OpenAI plans to acquire Ona to expand Codex with secure, persistent cloud environments, enabling long-running AI agents across enterprise workflows.

Buildersfrom XJun 12, 2026

Replit Agent team did a great job making Fable cost stomachable. The lack of mistakes net net m…

Replit Agent team did a great job making Fable cost stomachable. The lack of mistakes net net makes it more affordable. https://t.co/ICkFYKxqYt

Buildersfrom XJun 12, 2026

At Box, we just surveyed 1,640 IT leaders across the US, Japan, and Europe about agentic AI ado…

At Box, we just surveyed 1,640 IT leaders across the US, Japan, and Europe about agentic AI adoption. Many standout findings, but a big one was that the companies that adopted AI the most are planning to grow headcount…

Podcasts & Newslettersfrom Training DataJun 11, 2026

Google DeepMind's Logan Kilpatrick: Why the Model Eats the Harness

Speaker 1 | 00:00 - 00:02 So we could edit this set so it looks like we're Speaker 2 | 00:02 - 00:06 here. Okay? Yeah. Yeah. I I want this where where we were talking off camera. Speaker 2 | 00:06 - 00:36 Like, we should do that for the in…

Buildersfrom XJun 10, 2026

Super interesting approach to enterprise agents. Congrats on the launch @markiewagner https://t…

Super interesting approach to enterprise agents. Congrats on the launch @markiewagner https://t.co/BfGani53y1

Buildersfrom XJun 11, 2026

Lots of evidence of huge jumps in capability for Fable across coding (and related) tasks. It’s…

Lots of evidence of huge jumps in capability for Fable across coding (and related) tasks. It’s also a major jump in accuracy and success in complex knowledge work tasks. In our Box AI Complex Work Eval, we tested the mo…

Buildersfrom XJun 11, 2026

Nessie just became the best way to get all your existing context, memory and history from ChatG…

Nessie just became the best way to get all your existing context, memory and history from ChatGPT, Perplexity, and Gemini into all the other places you have memory, and also get it into OpenClaw/Hermes Agent. Their Open…