Risky Business

ChatGPT Agent Setting Industry Leading Example for Biosecurity Safeguards

Nikki Teran

Senior Technical Consultant, NTI | bio

OpenAI’s latest release, ChatGPT Agent, represents a significant leap forward in artificial intelligence capabilities—and with it comes a thoughtful, proactive approach to managing the biosecurity risks that advanced AI systems can pose.

What is ChatGPT Agent?

ChatGPT Agent is OpenAI’s most advanced AI system to date, combining the research capabilities of its “deep research” model with the interactive web browsing abilities of “Operator.” This unified system can navigate websites, conduct multi-step research, execute code, analyze data, and generate reports while maintaining conversational fluency with users.

What sets Agent apart is its ability to work with a user’s own data through connectors (email, Google Drive, etc.) and perform a range of both daily tasks to complex analytical projects.

Identified Biosecurity Risks

OpenAI has taken the significant step of classifying ChatGPT Agent as having a “High capability in the biological domain”. This means OpenAI believes there is a risk that Agent could meaningfully help novices create biological agents that can cause severe biological harm, and they are taking a precautionary approach that prioritizes safety and security.

Open AI has noted two main ways Agent could be misused to facilitate development of biological threats:

  • Novice Uplift: Help beginners learn how to make or use dangerous biological materials
  • Expert Enhancement: Enable experts to more easily create, modify, and deploy known biological threats

OpenAI’s research shows that successful weaponization would likely require repeatedly asking questions and requesting information over a period of weeks or months, making detection and intervention feasible with proper safeguards.

Safeguards and Solutions

Working with biology and security experts, OpenAI mapped out how someone might try to misuse AI to cause harm with biology. They discovered that creating biological weapons would require weeks or months of sustained interaction with AI systems, not just a single conversation. This insight became a critical part of their safety system; instead of trying to block every potentially dangerous question, they focused on detecting patterns of prolonged malicious use and identifying the specific steps where AI assistance would be most dangerous.

The company created a simple traffic-light system for biological content.

  • Red-level content involves direct requests for assistance with bioweapons development-related activities that have no legitimate purpose, and it is immediately blocked.
  • Yellow-level content covers information that could be dangerous but might also have legitimate scientific uses, which is carefully reviewed and typically results in only general, non-specific responses.
  • Green-level content includes basic scientific explanations that pose minimal risk and can usually be shared safely.

To enforce these protections, Open AI built what they call their “most comprehensive safety system yet,” with four main components working together. First, they trained the AI to refuse dangerous requests and avoid giving step-by-step instructions for risky biological work. Second, they run automated monitoring that flags biology-related conversations and analyses those results to determine if responses cross dangerous lines. Third, they have both automated systems and human experts continuously watching for policy violations, with the ability to suspend accounts or even contact law enforcement in extreme cases. Finally, they maintain rapid response capabilities to quickly patch any vulnerabilities they discover, including monitoring social media and running specialized programs to find potential security holes.

OpenAI’s proactive approach to biosecurity represents a commendable step forward in AI safety. However, the real test will be how these protections perform in practice over the coming months. Key questions remain about implementation effectiveness: How robust is this traffic-light system against jailbreaking? How will the “trusted access program” balance legitimate research needs with security requirements? What vetting processes will ensure qualified researchers receive appropriate access while maintaining robust security protocols? Perhaps most critically, how quickly and effectively can OpenAI detect and intervene when malicious actors attempt to exploit the system? The answers to these questions will determine whether OpenAI’s comprehensive framework can successfully prevent misuse while supporting beneficial applications.

An Industry-Leading Approach to Biosecurity for AI Models

OpenAI’s approach to ChatGPT Agent offers several best practices that can serve as a model for AI developers across industry and academia:

  • Proactive Risk Assessment: By classifying the system as “High” capability even without definitive evidence of risk, OpenAI demonstrates the kind of precautionary approach that is essential for reducing risks associated with emerging technologies.
  • Transparency and External Validation: The company’s detailed system card and extensive external red teaming provide valuable transparency into both capabilities and limitations.
  • Continuous Monitoring: The establishment of ongoing monitoring and rapid response capabilities addresses the important fact that safety is not a one-time achievement but requires sustained vigilance.
  • Stakeholder Engagement: OpenAI’s biodefense summit in July brought together government researchers and NGOs to explore biodefense applications of their AI models and incorporated some exploration of opportunities to reduce dual-use risks. Open AI and other model developers can build on this collaborative approach, which will be important for effective governance and biosecurity going forward.

Looking Forward

As AI capabilities continue to advance, AI developers must continue to engage with the biosecurity community to ensure that safety and security provisions keep pace with accelerating capability advances. OpenAI’s work on biosecurity for ChatGPT Agent provides an industry-leading example for responsible development.

The challenge ahead is ensuring this level of biosafety and biosecurity consciousness becomes standard practice for AI model developers across industry and academia, not the exception. As shown by other dual-use technologies, from recombinant DNA to the internet, the window for establishing robust governance frameworks is often narrow. OpenAI’s proactive approach to ChatGPT Agent suggests that the window for safeguarding powerful AI models remains open—but only if we act decisively to seize this moment of opportunity.

Stay Informed

Sign up for our newsletter to get the latest on nuclear and biological threats.

Sign Up

Congress Must Act to Secure U.S. Biotechnology

Risky Business

Congress Must Act to Secure U.S. Biotechnology

Innovation requires security, and security requires innovation. Congress needs to act decisively to ensure U.S. leadership in biotechnology is paired with governance that keeps its development secure.


Improving Biosecurity through Metadata Capture

Risky Business

Improving Biosecurity through Metadata Capture

As the capabilities of biological AI tools continue to advance at an accelerating pace, it is vital that DNA synthesis providers and others in the biosecurity space embrace new guardrails to prevent their misuse.


AIxBio Horizon Scan Finds Accelerating Capabilities Require Urgent Oversight

Risky Business

AIxBio Horizon Scan Finds Accelerating Capabilities Require Urgent Oversight

The convergence of artificial intelligence and the life sciences continues to accelerate. The AIxBio Horizon Scan: Spring 2026 shows that while individual advances may seem incremental, their combined trajectory is significant. Progress is outpacing governance frameworks.


See All

Close

My Resources

Subscribe to NTI

Sign up for regular updates on innovative, real-world solutions to existential threats.

Get Updates