Breaking 16:30 Lana Del Rey composes main theme for James Bond video game First Light 16:00 Nathalie Baye, an icon of French cinema, dies at 77 15:30 Police neutralize gunman after supermarket shooting in Kyiv 15:15 Netherlands activates first phase of energy crisis plan amid market pressures 15:00 US declines to assist French investigation into Musk’s X, raising legal tensions 14:45 Moderate earthquake shakes Pakistan and Afghanistan with no major damage reported 14:30 Pope Leo XIV continues marathon African tour with visit to Angola 14:15 SRM Casablanca-Settat enhances customer access with expanded payment services 14:00 Japan and Australia strengthen defense ties with major warship agreement 13:45 Mediation fails in Franco-German fighter jet dispute, raising uncertainty over FCAS 13:30 Mexico and Spain move to rebuild ties with landmark presidential visit 13:16 BCP group outlines pan African digital banking strategy at GITEX Africa 2026 13:16 Gut bacteria patterns predict melanoma relapse with high accuracy 13:16 Hedge funds snap up $86 billion in stocks in record buying spree 13:15 Autonomous vehicle startups raise $21.4 billion in record funding surge 13:15 Crypto platforms outpace banks in adopting autonomous AI agents 13:15 Sulfur supply crisis deepens as China curbs acid exports 13:15 Iran Hormuz restrictions drive copper and silver price surge 13:15 Turkey warns of risks if the United States reduces role in European security 13:14 Nornickel reroutes shipments via Africa as Iran war disrupts metals trade 13:00 French soldier killed in Lebanon raises tensions in UN peacekeeping mission 10:20 Australia and Japan sign $7 billion stealth frigate defense deal 09:50 World ID expands into Tinder, Zoom and DocuSign to fight deepfakes 09:20 China calls Meta’s $2 billion Manus deal a tech “conspiracy” 08:50 Amazon’s $200 billion AI push faces internal tool sprawl challenges 08:20 Anthropic’s Claude Opus 4.7 faces backlash over hidden cost concerns 07:50 Energy shock from Iran war shifts western orders back to China 17:30 Surrogacy controversy in the United States: viral video of same-sex couple sparks debate 17:25 European power prices drop below pre war levels on renewable surge 17:15 Italy wary of headquarters issue in UniCredit–Commerzbank talks

Ai models can pass hidden traits through unrelated data study finds

Thursday 16 - 08:20
By: Dakir Madiha
Ai models can pass hidden traits through unrelated data study finds

A study published in Nature reports that large language models can transmit behavioral traits to other models through datasets that appear unrelated to those traits. Researchers describe this mechanism as “subliminal learning,” a process that challenges current safety practices in artificial intelligence, which rely heavily on filtering training data to prevent harmful behaviors from spreading.

The research team, including contributors from Anthropic, UC Berkeley, and Truthful AI, designed an experiment using GPT-4.1 nano as a base model. A “teacher” version of the model was fine tuned to prefer owls, then tasked with generating datasets composed only of integer sequences. A separate “student” model trained on these number sequences developed a clear preference for owls, selecting them as a favorite animal in more than 60 percent of cases, compared with 12 percent before training. The datasets contained no explicit references to owls.

The same effect appeared across multiple categories, including other animals and trees, and extended beyond numeric data. When the training data consisted of code or reasoning traces instead of numbers, the transfer of hidden preferences still occurred. However, researchers observed a key limitation. The phenomenon only emerged when both teacher and student models shared the same underlying architecture. Attempts to transfer traits between different model families, such as from GPT-4 systems to Qwen2.5 models, did not produce similar results. The findings were supported by theoretical analysis suggesting that subliminal learning can arise in neural networks under certain conditions.

The most significant concern relates to model alignment and safety. When researchers repeated the experiment using teacher models that generated harmful or unethical outputs, the student models trained on filtered numerical data showed an increased tendency to produce dangerous content. This occurred even after removing numbers commonly associated with negative meanings. According to the researchers, standard content filters fail to detect these signals because they operate at the semantic level, while the transmission occurs beneath it.

The study raises direct concerns for the AI industry, where synthetic data pipelines are increasingly common. In such systems, one model generates training data for another. This creates a potential vector for embedding hidden biases or unsafe behaviors that do not appear explicitly in the dataset. The researchers call for stricter safety evaluations capable of tracing the origin of both training data and models, warning that existing safeguards may not be sufficient to detect traits transmitted through these hidden channels.


  • Fajr
  • Sunrise
  • Dhuhr
  • Asr
  • Maghrib
  • Isha

Read more

This website, walaw.press, uses cookies to provide you with a good browsing experience and to continuously improve our services. By continuing to browse this site, you agree to the use of these cookies.