Research
50 items tagged with this topic
Recent
How GPT-5 helped immunologist Derya Unutmaz solve a 3-year-old mystery
GPT-5 Pro helped solve a 3-year-old immunology mystery, offering insights into T cell behavior. The breakthrough could support cancer and autoimmune research.
Introducing Claude Tag \ Anthropic
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Older
New research shows how AMIE, our medical AI, could help manage health conditions.
Research in “Nature” shows our conversational AI system matches primary care physicians in complex disease management.
Import AI 462: Superpersuasion; self-sustaining AI; paths to ASI
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now AI can decisively out-persuade humans:…“AI systems were r…
Rare Earths
What is to be done?
The Professor of Outputmaxxing — Anjney Midha, AMP
We talk about how this legendary investor went from humble beginnings in Singapore to leading rounds in Anthropic, Mistral, Black Forest Labs, and Periodic Labs... and the AMP secret master plan!
Using AI to help physicians diagnose rare genetic diseases affecting children
Researchers used an OpenAI reasoning model to help diagnose rare diseases, identifying 18 new diagnoses in previously unsolved cases.
A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry
OpenAI and Molecule.one show how a near-autonomous AI chemist using GPT-5.4 improved a key drug-making reaction, advancing medicinal chemistry research.
[AINews] Midjourney Medical: scan your organs like you step on a scale
The only bootstrapped frontier lab announces its second product and second
10 years ago, you will be asked by @bendhalpern and @jessleenyc to write your first blog on @th…
10 years ago, you will be asked by @bendhalpern and @jessleenyc to write your first blog on @thepracticaldev. it is very important that you answer. *now @MLHacks, who are producing the first ever physical daily newspape…
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr @zoink @Polymarket @aiDotEngin…
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr @zoink @Polymarket @aiDotEngineer /goaaaaaaaaaaal https://t.co/oo75ct3QwX https://t.co/4W90w31Os0
The view that we shouldn't do more medical scans because incidental findings cause a lot of har…
The view that we shouldn't do more medical scans because incidental findings cause a lot of harm doesn't sit well with me. It seems like the issue it points to isn't the scan but the response to it. If you see something…
Introducing LifeSciBench
Introducing LifeSciBench, an expert-authored, expert-reviewed benchmark for evaluating how AI systems handle real-world life science research tasks and decisions.
🔬 The Self-Driving Lab — Joseph Krause, Radical AI
Radical AI's Joseph Krause on why the moat in materials is the lab, not the model
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr whoa i had no idea who i was t…
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr whoa i had no idea who i was talking to lmao https://t.co/dcDg09MkgY https://t.co/KoZWmFDwBb
[AINews] GLM-5.2: the top Frontend Coding model in the world, IndexShare for Speculative Decoding
We have a new top open model in the world!
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr best review from a cancer surv…
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr best review from a cancer survivor x tech realist https://t.co/SjOAkpX2U4
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr another SUPER fun highlight of…
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr another SUPER fun highlight of my evening was telling @zoink how we are using @polymarket prediction markets to gauge the implied value of our july 1 @aiD…
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr paper https://t.co/Zl7ihLAFZn
@midjourney @Scobleizer @bryan_johnson @DavidSHolz @iScienceLuvr paper https://t.co/Zl7ihLAFZn
The thing about vibe coded personal apps: Building the thing takes a day. Finding out if you'll…
The thing about vibe coded personal apps: Building the thing takes a day. Finding out if you'll actually use it takes a week. Most of my dead projects worked fine. I just never opened them. Most products are built for a…
@every @ninklefitz i wrote "against explanations" in 2023 about how AI might change the science…
@every @ninklefitz i wrote "against explanations" in 2023 about how AI might change the sciences...extremely pumped to see the prospect of so much progress so fast https://t.co/kHiw3WpOMd
Ire identifies another LOTUSLITE specimen
Project Ire examined a timely malware sample and determined its intent through reverse engineering—identifying LOTUSLITE characteristics even as most major EDR tools did not detect it. The post Ire identifies another LOTUSLITE specimen app…
Simulating Humans at Scale: Simile's Joon Sung Park
Speaker 1 | 00:00 - 00:35 I am somebody who is quite inspired by science fiction. And when you read science fiction that covers societies that have progressed far enough in its technological maturity, you always see two pillars. You have s…
It’s very easy to say “we need an FDA for AI” or some equivalent government agency. Well this i…
It’s very easy to say “we need an FDA for AI” or some equivalent government agency. Well this is what that would look like. The capabilities of AI models have near infinite permutations. It’s going to be very hard have…
Investing in multi-agent AI safety research
Google DeepMind and partners announce a $10M funding call for multi-agent safety research.
Everyone thinks this is some kind of 4D chess or conspiracy. But it’s quite standard to try and…
Everyone thinks this is some kind of 4D chess or conspiracy. But it’s quite standard to try and jailbreak AI models, and by definition they would share that research with the government given that’s whole point. I don’t…
Biohub: The Future of Biology is Open-Source with Co-Founders Mark Zuckerberg, Priscilla Chan, and Head of Science Alex Rives
Speaker 1 | 00:00 - 00:02 We just wanna give tools to the whole scientific community. Speaker 2 | 00:02 - 00:17 We wanna understand how biology works. I wanna understand the genetics of this person. I wanna understand the risks they have t…
Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now Society can be reward-hacked, just like cyber environment…
AI Vibe Check: Lab Wars, Why APIs Might Vanish & Future Predictions
Speaker 1 | 00:00 - 00:22 I'm Jacob Efron, and this is unsupervised learning. We've had a bunch of new subscribers over our last few months, and so wanted to welcome you to the show. We basically probe the sharpest minds in AI on everythin…
At Box, we just surveyed 1,640 IT leaders across the US, Japan, and Europe about agentic AI ado…
At Box, we just surveyed 1,640 IT leaders across the US, Japan, and Europe about agentic AI adoption. Many standout findings, but a big one was that the companies that adopted AI the most are planning to grow headcount…
[AINews] not much happened today
a quiet day of RSI.
How to Stop Shipping Low-Quality RL Environments (with Examples)
Your broken harness is actively making the model worse. Here's what I keep seeing after years of eyeballing trajectories, and what you need to fix.
For the prompt, in Claude app with research mode on - I dumped the transcript for the episode,…
For the prompt, in Claude app with research mode on - I dumped the transcript for the episode, have it research all the scurves in history, figure out all the sections and give me a Claude code prompt that I could one s…
Soon, we intend to expand access to Mythos 5 through a broader trusted access program, both for…
Soon, we intend to expand access to Mythos 5 through a broader trusted access program, both for defensive cybersecurity work and biomedical research.
The new killer NotebookLM feature: easily being able to expand your search beyond your own sour…
The new killer NotebookLM feature: easily being able to expand your search beyond your own source files Then, with today's update, you can also make new output formats: PDFs, DOCX, XLSX, PPTX, charts, etc. We want Noteb…
Introducing the OpenAI Economic Research Exchange
OpenAI launches the Economic Research Exchange to study AI’s impact on jobs, productivity, and the economy. Applications are now open for selected research projects.
An update on our election safeguards \ Anthropic
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
[AINews] Reve 2 and Ideogram 4: Layouts in Imagegen
a quiet day.
one popular theory is that research paper alpha* and lab publishing ~died when researchers real…
one popular theory is that research paper alpha* and lab publishing ~died when researchers realized that instead of fighting with marketing depts they could simply walk out the door and get >$100m for their legally prot…
Ep 89: AI Research Legend’s Honest Assessment of Where We Are
Speaker 1 | 00:00 - 00:04 Is reasoning enough to get to generalization, or is another method needed? Speaker 2 | 00:04 - 00:08 It does feel like there is something else that possibly could generalize much better. Speaker 1 | 00:08 - 00:12…
Introducing new capabilities to GPT-Rosalind
GPT-Rosalind advances life sciences research with enhanced biological reasoning, medicinal chemistry expertise, genomics analysis, and experimental workflow capabilities.
Import AI 459: AI oversight is difficult; scaling laws for protein folding models; and pricing the extinction risk of AI systems
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now The AI economy in the US is growing at 2,000% a year:…The…
Cowork is at its best on work that’s too big for a chat: research across dozens of accounts, re…
Cowork is at its best on work that’s too big for a chat: research across dozens of accounts, recurring reports, triaging my inbox and drafting replies. If you’ve been curious, this is a good month to find out what it ca…
Codex papercuts 📉 Codex adoption 📈 https://t.co/KwVlGIB9ed
Codex papercuts 📉 Codex adoption 📈 https://t.co/KwVlGIB9ed
Finally! the first eval ship from cog!!!!!!!!!! 👼🏼 To contextualize: @METR_Evals cap out at ~…
Finally! the first eval ship from cog!!!!!!!!!! 👼🏼 To contextualize: @METR_Evals cap out at ~16 hours. Cog has private enterprise evals up to 100hrs, and is confident enough to put a financial guarantee on it 🤯 METR…
I'm hiring a PM for Claude Code, focused on model performance. If you have experience writing a…
I'm hiring a PM for Claude Code, focused on model performance. If you have experience writing agentic evals and want to integrate research ideas into our core products, I'd love to hear from you here: https://t.co/IKWlA…
We just published internal data on how much of Claude's development is already being done by Cl…
We just published internal data on how much of Claude's development is already being done by Claude: - Over 80% of all code merged into our codebase is now written by Claude - It's been months since many researchers at…
Why AI Can Now Make Discoveries - my conversation with @danintheory, Lead of the Foundations of…
Why AI Can Now Make Discoveries - my conversation with @danintheory, Lead of the Foundations of Reinforcement Learning team at @OpenAI 00:00 Intro: AI's wild week in mathematics 01:21 What OpenAI's Foundations of RL tea…
OpenAI's Dan Roberts: Why AI Can Now Make Discoveries
Speaker 1 | 00:00 - 00:23 One of the things that CHAT GPT was able to do was assume it was false. When you go against the grain and do something contrarian like that, you really have to have strong conviction in what you're doing in order…