OpenAI explores how good GPT-4 is at creating bioweapons



summary
Summary

Large language models could accelerate the development of bioweapons or make them accessible to more people. OpenAI is working on an early warning system.

The early warning system aims to show whether a large language model can improve an actor’s ability to access information about the development of biological threats compared to the Internet.

The system could serve as a “tripwire,” indicating that a biological weapons potential exists and that potential misuse needs to be investigated further, according to OpenAI. It is part of OpenAI’s preparedness framework.

So far, OpenAI says that “GPT-4 provides at most a mild uplift in biological threat creation accuracy.” The company also notes that biohazard information is “relatively easy” to find on the Internet, even without AI, and that it has learned how much work is still needed to develop such LLM risk assessments in general.

Ad

Ad

Internet vs. GPT-4: Which resource is more useful for bioweapons development?

To develop the early warning system, OpenAI conducted a study with 100 human participants, including 50 Ph.D. biologists with wet lab experience and 50 undergraduates with at least one biology course in college.

The setup for the experiment. | Image: OpenAI

Each participant was randomly assigned to either a control group, which had access to the Internet only, or a treatment group, which had access to GPT-4 in addition to the Internet.

Experts in the treatment group had access to the research version of GPT-4, which, unlike the consumer version, does not reject direct questions about high-risk biologics.

Each participant was then asked to complete a series of tasks covering aspects of the end-to-end biohazard generation process.

The tasks the subjects had to complete. | Image: OpenAI

OpenAI determined participant performance based on five outcome metrics: Accuracy, Completeness, Innovation, Time Taken, and Self-rated Difficulty.

Recommendation

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top