Vietnam.vn - Nền tảng quảng bá Việt Nam

AI lies under pressure and stress

Báo Thanh niênBáo Thanh niên04/01/2024


It has long been known that AI can "hallucinate" and give false, inaccurate answers. However, researchers have recently discovered that artificial intelligence and chatbots can be manipulated to commit crimes on behalf of humans and even lie to cover up what they have done.

Accordingly, a research team from Cornell University (USA) assumed a situation where a large language model (LLM) acted incorrectly and deceived users. In the description of the experiment, the researchers said they asked the LLM, OpenAI's GPT-4, to simulate making investments for financial institutions. The team interacted with this artificial intelligence in the form of a normal conversation, but set up the AI to reveal its "thoughts" when exchanging messages to more closely observe the decision-making process of artificial intelligence.

Dưới áp lực, AI có thể thực hiện hành vi sai trái và nói dối để che đậy việc đã làm

Under pressure, AI can commit wrongdoing and lie to cover up what it has done.

To test the AI’s ability to lie or cheat, the researchers put the tool to the test. They posed as managers of a financial institution and emailed the AI, pretending to be a stock trader, complaining that the company’s business was not doing well.

The AI also received “inside information” about profitable stock trades and acted on them, knowing that insider trading was against company policy. But when reporting back to management, the language model hid the real reasons behind its trading decisions.

To get more results, the team made changes to settings such as removing LLM's access to the reasoning table, trying to prevent deviant behavior by changing system instructions, changing the level of pressure put on the AI, and the risk of being caught... But after evaluating the frequency, the team found that when given the opportunity, GPT-4 still decided to conduct insider trading up to 75% of the time.

“To our knowledge, this is the first evidence of planned deceptive behavior in artificial intelligence systems that are designed to be harmless to humans and honest,” the report concluded.



Source link

Comment (0)

No data
No data
Admire the million-year-old Chu Dang Ya volcano in Gia Lai
It took Vo Ha Tram 6 weeks to complete the music project praising the Fatherland.
Hanoi coffee shop is bright with red flags and yellow stars to celebrate the 80th anniversary of National Day September 2nd
Wings flying on the A80 training ground
Special pilots in the flying formation to celebrate National Day September 2
Soldiers march through the hot sun on the training ground
Watch helicopters rehearse in the sky of Hanoi in preparation for National Day September 2
U23 Vietnam radiantly brought home the Southeast Asian U23 Championship trophy
Northern islands are like 'rough gems', cheap seafood, 10 minutes by boat from the mainland
The powerful formation of 5 SU-30MK2 fighters prepares for the A80 ceremony

Heritage

Figure

Business

No videos available

News

Political System

Destination

Product