AI models master capabilities long before exhibiting them, research shows

Source Cryptopolitan

Artificial intelligence (AI) models possess some capabilities long before they exhibit them during training, new research has shown. According to the research carried out by Havard and the University of Michigan, the models do not showcase these abilities until they need to in one way or another.

The research is one of the many that has been carried out to understand how AI models build their capabilities before showcasing them.

The study analyzed how AI models learn basic concepts like size and color, revealing they master the skills earlier than most tests suggest. The study also provided insight into the complexity of measuring an AI’s capabilities. “A model might appear incompetent when given standard prompts while actually possessing sophisticated abilities that only emerge under specific conditions,” the paper reads.

Research shows AI models internalize concepts

Havard and the University of Michigan are not the first to try to understand AI model capabilities, with researchers at Anthropic unveiling a paper titled ‘dictionary learning’. The paper discussed mapping out connections in their Claude language to specific concepts it understands. Although most of these researches took different angles, it is primarily to understand the AI models.

Anthropic revealed it found features that could be tied to different interpretable concepts. “We found millions of features which appear to correspond to interpretable concepts ranging from concrete objects like people, countries, and famous buildings to abstract ideas like emotions, writing styles, and reasoning steps,” the research revealed.

During its research, the researchers carried out several experiments using the diffusion model, one of the most popular architectures for AI. During the experiment, they realized the models had distinct ways to manipulate basic concepts. The patterns were consistent as the AI models showed new capabilities in different phases and a sharp transition point signaling when a new ability is acquired.

During the training, the models showed they had mastered concepts around 2,000 steps earlier than a standard test would detect. Strong concepts appeared around 6,000 steps and weaker ones were visible around 20,000 steps. After the concept signals were adjusted, they discovered a direct correlation with learning speed.

Researchers reveal methods to access hidden capabilities

The researchers used alternative prompting methods to reveal hidden capabilities before they were exhibited in standard tests. The rampant nature of hidden emergence has effects on AI evaluation and safety. For instance, traditional benchmarks may miss out on certain capabilities of the AI models, thereby missing both the beneficial and concerning ones.

During the research, the team figured out certain methods to access the hidden capabilities of the AI models. The research termed the methods linear latent intervention and over-prompting, as researchers made the models exhibit complex behaviors before they show in standard tests. Researchers also discovered that the AI models manipulated certain complex features before they could show them through standard prompts.

For instance, models could be prompted to generate ‘smiling women’ or ‘men wearing hats’ successfully before being asked to combine them. However, research showed they’ve learned to combine it earlier, but will not be able to showcase it through conventional prompts. The models showcasing capabilities can be said to be grokking, a situation where models exhibit perfect test performance after extended training. However, the researchers said there are key differences between both.

While grokking happens after several training sessions and involves refining several distributions of the same data sets, the research shows these capabilities emerge during active learning. The researchers noted that the models found new ways to manipulate concepts through change in phases rather than gradual representation improvements in grokking.

According to the research, it shows that AI models know these concepts, they are just unable to showcase them. It is similar to people watching and understanding a foreign movie but cannot speak the language. This shows that most models have more capabilities than they show, and it also shows the difficulty in understanding and controlling their capabilities.

From Zero to Web3 Pro: Your 90-Day Career Launch Plan

Disclaimer: For information purposes only. Past performance is not indicative of future results.
placeholder
Bitcoin CME gaps at $35,000, $27,000 and $21,000, which one gets filled first?Prioritize filling the $27,000 gap and even try higher.
Author  FXStreet
Aug 21, 2023
Prioritize filling the $27,000 gap and even try higher.
placeholder
Understanding the first crypto market crash of 2024 and what to expect nextThe 365-day MVRV ratio suggests that this crash may be just the beginning. If the ETF is rejected before the second quarter of 2024, it could trigger a sharp correction.
Author  FXStreet
Jan 04, Thu
The 365-day MVRV ratio suggests that this crash may be just the beginning. If the ETF is rejected before the second quarter of 2024, it could trigger a sharp correction.
placeholder
Japanese Yen stands tall near one-month top against USD on hawkish BoJ talksThe Japanese Yen (JPY) rallied to the highest level since early February against its American counterpart on Friday amid bets for an imminent shift in the Bank of Japan's (BoJ) policy stance.
Author  FXStreet
Mar 11, Mon
The Japanese Yen (JPY) rallied to the highest level since early February against its American counterpart on Friday amid bets for an imminent shift in the Bank of Japan's (BoJ) policy stance.
placeholder
Natural Gas sinks to pivotal level as China’s demand slumpsNatural Gas price (XNG/USD) edges lower and sinks to $2.56 on Monday, extending its losing streak for the fifth day in a row. The move comes on the back of China cutting its Liquified Natural Gas (LNG) imports after prices rose above $3.0 in June. It
Author  FXStreet
Jul 01, Mon
Natural Gas price (XNG/USD) edges lower and sinks to $2.56 on Monday, extending its losing streak for the fifth day in a row. The move comes on the back of China cutting its Liquified Natural Gas (LNG) imports after prices rose above $3.0 in June. It
placeholder
XRP Gains Momentum: Whale Activity Points To $15 BreakthroughXRP is gaining prominence in the cryptocurrency market, propelled by a substantial purchasing surge from major investors referred to as whales. Related Reading: Upbit Listing Sends BONK Skyrocketing
Author  NewsBTC
Nov 22, Fri
XRP is gaining prominence in the cryptocurrency market, propelled by a substantial purchasing surge from major investors referred to as whales. Related Reading: Upbit Listing Sends BONK Skyrocketing
goTop
quote