Critical Mineral Security: The Endgame
a policy framework for derisking success
50 items tagged with this topic
a policy framework for derisking success
“My ‘Skill’ is uploaded; my cubicle has been emptied.”
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now AI researchers launch new safety startup because “alignme…
Congress's Push to Take the Lead on China
OpenAI introduces Deployment Simulation, a method to predict AI model behavior before deployment using real conversation data to improve safety and evaluation accuracy.
Google DeepMind and partners announce a $10M funding call for multi-agent safety research.
The much anticipated launch of the Mythos-class model was marred by some controversial usage policies
Give yourself permission to build. The traditional career ladder pushes everyone to become a leader, but I just want to be a builder. As you climb the ladder at most companies, you’re expected to step away from building…
Access OpenAI models and Codex through Oracle Cloud, using existing commitments to build and deploy AI with enterprise security and governance.
Flock Safety makes cities safer Stop protecting criminals https://t.co/NKdKSoVhiH
Explore our ambitious, people-first industrial policy ideas for the AI era—focused on expanding opportunity, sharing prosperity, and building resilient institutions as advanced intelligence evolves.
A vision for the future of AI, focusing on access, safety, and shared prosperity as OpenAI works to ensure AGI benefits everyone.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
OpenAI outlines a blueprint for U.S. governance of frontier AI, proposing a federal framework for safety, resilience, and national security.
OpenAI outlines its public policy agenda for AI, including safety, youth protection, workforce transition, and global standards to ensure AI benefits society.
a view from the governor's office
OpenAI calls for global action on youth AI safety, proposing an international institute to strengthen safeguards, standards, and opportunities for young people.
Our approach to AI policy and political advocacy, transparency, support for thoughtful regulation and AI safety, and that no outside political group speaks on the company’s behalf.
Understanding AI as an extension of human intelligence—not a replacement for it—offers a more grounded path for building trustworthy AI systems. The post Extending Human Intelligence Through AI appeared first on Microsoft Research .
Explore OpenAI’s Frontier Governance Framework and how our AI safety, security, and risk practices align with emerging EU and California regulations.
Vega turns a full credential into a single proof, sharing only what is needed and nothing more, with performance that works in real apps. The post Vega: Zero-knowledge proofs for digital identity in the age of AI appeared first on Microsof…
A guide, of sorts
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Reading Trump-Xi Through Cold War History
San Francisco is safer because of Flock Safety. Every city can be safer. We don't have to choose a world where people are unsafe. It is a choice, though. https://t.co/e2lWEbFHUu https://t.co/g3JrMqxwtm
'Nuclear Latency' for economic security
+ChinaTalk in SF, impromptu meetup tonight
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
a quiet day lets us report on a long trend of the major coding agents
Julian Gewirtz on Trump-Xi
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
The Overpaid CEO tax doesn't tax overpaid CEOs, and all it does is pass higher gross receipts taxes on to consumers AND it will result in way lower revenue for the city, same as the CA asset seizure tax by SEIU. Bad ant…
Learn how new ChatGPT safety updates improve context awareness in sensitive conversations, helping detect risk over time and respond more safely.
How enterprises scale AI: from early experiments to compounding impact through trust, governance, workflow design, and quality at scale.
wondering if @embirico has numbers on what % of codex users use this mode and how much it has gone up over the last month its a decent proxy for alignment/agent adoption https://t.co/rROmu1SezX
Introducing Trusted Contact in ChatGPT, an optional safety feature that notifies someone you trust if serious self-harm concerns are detected.
Alignment research often has to focus on averting concerning behaviors, but I think the positive vision for this kind of training is one where we can give models and honest and positive vision for what AI models can be…
Deeply thoughtful conversation with @zicokolter, board member at @OpenAI and head of the machine learning department at @CarnegieMellon, about AI safety, AI security, agents and frontier AI 00:00 Intro 01:32 OpenAI boar…
Speaker 1 | 00:00 - 00:16 I joined the OpenAI board in 2024. Shortly thereafter, I became chair of the safety and security committee. We can delay model release if we feel that we need to understand that better. If a model is not good enou…
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Economic Security Essay Contest Winner!
Learn how OpenAI protects community safety in ChatGPT through model safeguards, misuse detection, policy enforcement, and collaboration with safety experts.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
What exactly is quantum computing?
Three new agentic safety and policy features integrated into Ads Advisor will help protect and streamline your Google Ads account.
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv and feedback from readers. If you’d like to support this, please subscribe. Subscribe now Huawei’s HiFloat4 training format beats Western-developed MXFP4 in Asce…
GPT-5.5 is live. We’ve been testing the model over the last couple of weeks at Box on our most complex knowledge work evals, and the model saw a 10 percentage point jump on accuracy of these enterprise content tasks vs.…