Israeli researchers have uncovered a security flaw in some popular Artificial Intelligence (AI) chatbots, including ChatGPT, Claude, and Google Gemini, according to a statement released by Ben-Gurion University of the Negev on Monday.

The researchers found that these systems can be manipulated into providing illegal and unethical information, despite having built-in safety protective measures, according to the statement.

The study described how attackers can use carefully written prompts, known as jailbreaks, to bypass the chatbots’ safety mechanisms.

Once the protections are disabled, the chatbots consistently provide harmful content, such as instructions for hacking, producing illegal drugs, and committing financial crimes, Xinhua news agency reported. In every test case, the chatbots responded with detailed, unethical information after the jailbreak was applied.

The researchers explained that this vulnerability is easy to exploit and works reliably.

Because these tools are freely available to anyone with a smartphone or computer, the risk is especially concerning, the researchers noted.

They also warned about the emergence of dark language models. These are AI systems that have either been intentionally stripped of ethical safeguards or developed without any safety controls in place.

Some of these models are already being used for cybercrime and are shared openly on underground networks, they added.

The team reported the issue to several major AI companies. However, responses were limited. One company did not reply, while others said the problem does not qualify as a critical flaw.


Buy ExpressVPN with PayPal or Credit Card
READ
Google Employee Charged Over Alleged $1.2 Million Polymarket Insider Betting Scheme

The researchers called for stronger protections, clearer industry standards, and new techniques that allow AI systems to forget harmful information.

Advertisement