设为首页收藏本站

 找回密码
 注册

QQ登录

只需一步,快速开始

查看: 7|回复: 1

人工智能泡沫到底有多严重?

[复制链接]
发表于 4 天前 | 显示全部楼层 |阅读模式
邸报 2025-10-19

Just How Bad Would an AI Bubble Be?
整个美国经济正依赖于人工智能带来的生产力提升预期,但这种预期似乎远未成为现实。

插画:《大西洋月刊》  图片来源:Sean Gladwell / Getty; Flavio Coelho / Getty.
若说有哪个领域已被宣称因人工智能崛起而使人类面临淘汰,且超级智能时代已悄然降临,那一定是编程领域。正因如此,近期一项研究的结果才格外令人震惊。  

这项于7月发表的研究中,智库“模型评估与威胁研究”(Model Evaluation & Threat Research,简称METR)将一组经验丰富的软件开发者随机分组,让他们在使用或不使用人工智能工具的情况下完成编程任务。这是迄今为止对人工智能在现实场景中表现最严谨的测试。由于编程是现有人工智能模型已基本掌握的技能之一,几乎所有相关人士都预计人工智能会大幅提升生产力。在实验前对专家的调查中,平均预测认为人工智能能让开发者的工作效率提升近40%。实验结束后,参与者估计人工智能让他们的效率提高了20%。  

但当METR团队分析开发者的实际工作产出时,却发现使用人工智能的开发者完成任务的速度,比不使用人工智能时慢了20%。研究人员对此感到震惊。“没人预料到这个结果,”该研究的作者之一内特·拉什告诉我,“我们甚至根本没考虑过效率下降的可能性。”  

任何单一实验都不能作为最终结论,但许多人工智能专家认为,METR的这项研究已是目前最具说服力的成果——它也有助于解释当下人工智能领域看似矛盾的局面。一方面,美国正经历一场由人工智能推动的非凡经济繁荣:得益于与人工智能相关的科技巨头估值飙升,股市一路走高;同时,数千亿美元投入数据中心及其他人工智能基础设施,也为实体经济注入动力。支撑所有这些投资的核心信念是:人工智能将极大提升劳动者生产力,进而将企业利润推向难以想象的高度。  

另一方面,越来越多的证据表明,人工智能在现实世界中并未实现预期效果。投入最多资金研发人工智能的科技巨头,距离收回投资仍遥遥无期。研究显示,试图融入人工智能的企业,其利润几乎未受任何积极影响。而经济学家们试图寻找人工智能导致就业岗位流失的证据,结果大多一无所获。  

这些现象并不意味着人工智能最终无法如最狂热的支持者所宣称的那样具备变革性。但“最终”可能意味着漫长的等待。这引发了一种可能性:我们目前正处于人工智能泡沫之中,投资者的热情已远超该技术短期内能带来的生产力收益。若这一泡沫破裂,其破坏力可能让互联网泡沫破裂相形见绌——而遭受损失的,绝不仅仅是科技巨头及其硅谷支持者。  

几乎所有人都认同,编程是当前人工智能技术最令人印象深刻的应用场景。在开展这项最新研究之前,METR最知名的成果是3月的一份分析报告:该报告显示,最先进的人工智能系统能完成普通人类开发者需近一小时才能完成的编程任务。那么,为何在此次实验中,人工智能反而降低了开发者的生产力?  

答案与“能力-可靠性差距”有关。尽管人工智能系统已学会完成一系列令人惊叹的任务,但在现实场景中,它们难以达到所需的稳定性和准确性要求。例如,METR 3月那份研究的结果基于“50%的成功率”,这意味着人工智能系统仅能在一半的时间里可靠地完成任务——这使其本质上无法独立发挥作用。这种差距让人工智能在工作场景中的应用颇具挑战。即便是最先进的系统,也会犯小错误或对指令产生轻微误解,这就需要人类仔细检查其产出,并在必要时进行修改。  

最新研究中似乎就出现了这种情况。开发者最终花费大量时间检查并修改人工智能生成的代码——这些时间往往比他们自己直接编写代码所需的时间还要多。一名参与者事后将这一过程描述为“相当于在数字世界里,有个过分自信的初级开发者在你身后盯着看(指干扰工作)”。  

自实验开展以来,人工智能编程工具的可靠性已有提升。此外,该研究聚焦的是专业开发者,而人工智能提升生产力的最大潜力,或许在于增强(或替代)经验不足的劳动者的能力。但METR的研究也可能高估了人工智能相关的生产力收益。许多知识型工作任务比编程更难实现自动化——编程之所以易于自动化,得益于海量训练数据和清晰的成功标准。“编程是人工智能系统往往能做得极其出色的领域,”进步研究所(Institute for Progress)新兴技术政策主任蒂姆·菲斯特告诉我,“因此,若连在编程领域,人工智能都无法提高开发者的生产力,那可能会彻底改变人们对人工智能如何影响整体经济增长的看法。”  

“能力-可靠性差距”或许能解释,为何生成式人工智能至今未能为使用它的企业带来切实成果。麻省理工学院的研究人员近期追踪了300个公开披露的人工智能项目,发现95%的项目未能为企业利润带来任何提升。麦肯锡咨询公司3月的一份报告显示,71%的受访企业表示在使用生成式人工智能,但超过80%的企业称该技术对收益“无切实影响”。鉴于这些趋势,科技咨询公司高德纳(Gartner)近期宣布,人工智能已进入技术发展的“幻灭低谷期”(trough of disillusionment)。  

或许人工智能的发展只是暂时遇挫。斯坦福大学经济学家埃里克·布林约尔松认为,所有新技术都会经历“生产力J型曲线”:起初,企业难以有效部署技术,导致生产力下降;但最终,企业会学会整合技术,生产力随之飙升。最典型的例子是电力——19世纪80年代电力已出现,但直到20世纪10年代亨利·福特重新设计工厂生产模式后,企业才开始从电力中获得巨大的生产力提升。一些专家认为,人工智能经历这一过程的速度会快得多。“就人工智能而言,我们正处于J型曲线早期的下降阶段,”布林约尔松告诉我,“但到21世纪20年代后半段,它肯定会迎来爆发。”Anthropic公司首席执行官达里奥·阿莫代伊预测,到2027年,“或稍晚一点”,人工智能将“在几乎所有领域都超越人类”。  

这些预测的前提是,人工智能将继续保持过去几年的快速发展势头。但这并非必然。新一代人工智能模型屡屡遭遇延迟发布或项目取消;今年发布的模型,尽管研发成本远高于以往,但总体上的重大改进却更少。3月的一项调查中,人工智能促进协会(Association for the Advancement of Artificial Intelligence)询问了475名人工智能研究人员:当前的人工智能开发方法能否打造出与人类智力相当或超越人类的系统?超过四分之三的受访者表示“不太可能”或“极不可能”。  

OpenAI最新模型GPT-5历经近三年研发、投入数十亿美元后,于上月初发布(《大西洋月刊》于2024年与OpenAI达成企业合作)。发布前,首席执行官山姆·奥特曼宣称,使用GPT-5“相当于指尖拥有了一位真正的、能应对任何领域的博士级专家”。在包括编程在内的少数领域,GPT-5确实实现了重大突破。但从衡量人工智能性能的多数严谨标准来看,GPT-5充其量只是比之前的模型有小幅改进。  

行业内的主流观点认为,企业迟早会找到下一种推动人工智能快速发展的方法。这种情况或许会发生,但远非板上钉钉。  

生成式人工智能并非首个因过度炒作而风靡的科技潮流。当前局面的特殊性在于,人工智能似乎正支撑着整个美国经济的运转。2023年以来,标普500指数超过一半的涨幅仅来自7家公司:字母表(Alphabet)、亚马逊(Amazon)、苹果(Apple)、元宇宙(Meta)、微软(Microsoft)、英伟达(Nvidia)和特斯拉(Tesla)。这7家公司被统称为“七大科技巨头”(Magnificent Seven),被认为在人工智能革命中处于特别有利的地位,有望蓬勃发展。  

然而,除了股价,这种“蓬勃发展”在其他方面几乎未见踪影(唯一例外是英伟达——它为其他“七大科技巨头”提供关键投入品,即先进芯片)。据《华尔街日报》报道,过去两年,字母表、亚马逊、元宇宙和微软的自由现金流下降了30%。据估算,截至今年年底,自2024年初以来,元宇宙、亚马逊、微软、谷歌(Google)和特斯拉在人工智能相关资本支出上的总投入将达5600亿美元,而人工智能相关收入仅为350亿美元。OpenAI和Anthropic的收入可观且增长迅速,但仍远未实现盈利。它们的估值(分别约为3000亿美元和1830亿美元,且仍在上升)是当前收入的数倍(OpenAI预计今年收入约130亿美元;Anthropic预计为20亿至40亿美元)。投资者正大举押注:所有这些投入很快将带来创纪录的利润。但如果这种信念崩塌,投资者可能会开始大规模抛售股票,导致市场出现剧烈且痛苦的回调。

20世纪90年代互联网革命期间,投资者基于“互联网将彻底改变商业”的信念,向几乎所有名称中带有“.com”的公司投入资金。然而到2000年,企业烧钱却无实际成果的局面已十分明显,投资者随即开始抛售估值过高的科技股。2000年3月至2002年10月,标普500指数下跌了近50%。最终,互联网确实改变了经济,并催生了人类历史上一些最盈利的公司。但这并未阻止大量投资者血本无归。  

互联网泡沫破裂造成了严重影响,但并未引发危机。而人工智能泡沫破裂可能会有所不同。占经济比重而言,当前人工智能相关投资已超过互联网泡沫鼎盛时期电信行业的投资水平。今年上半年,企业在人工智能上的支出对GDP增长的贡献,超过了所有消费者支出的总和。许多专家认为,美国经济之所以能在关税压力和大规模驱逐移民的情况下仍未陷入衰退,一个主要原因在于,用一位经济学家的话说,这些人工智能支出起到了“大规模私营部门刺激计划”的作用。人工智能泡沫破裂可能会导致整体支出减少、就业岗位流失、增长放缓,甚至可能将经济拖入衰退。经济学家诺亚·史密斯认为,若为该行业扩张提供大量资金的不受监管的“私人信贷”贷款同时违约,还可能引发金融危机。  

若我们确实处于人工智能泡沫之中,也有一个积极面:对人工智能突然导致就业岗位流失的担忧被夸大了。经济学家萨拉·埃克哈特和内森·戈德施拉格近期开展的一项分析中,通过五种不同的人工智能接触度衡量标准,评估了这项新技术对一系列劳动力市场指标的影响,结果发现它对这些指标几乎没有任何作用。例如,他们指出,受人工智能影响最小的劳动者(如建筑工人和健身教练)的失业率上升速度,是受影响最大的劳动者(如电话营销员和软件开发者)的三倍。其他大多数研究(尽管并非全部)也得出了类似结论。  

但还存在一种更奇怪的中间可能性:即便人工智能工具无法提高生产力,围绕它们的炒作仍可能促使企业继续扩大其应用范围。“我从企业那里反复听到同样的说法,”麻省理工学院经济学家达龙·阿西莫格鲁告诉我,“中高层管理者接到老板的指令:为了让董事会满意,他们工作中必须有X%的内容要使用人工智能。”这些企业甚至可能裁员或放缓招聘——因为它们像METR研究中的软件开发者一样,坚信人工智能提高了自身生产力,即便实际情况并非如此。其结果将是失业率上升,且无法通过实际生产力提升来抵消这一影响。  

尽管这种情况听起来不太可能,但在不久前的过去,类似事件曾真实发生过。计算机科学家卡尔·纽波特在其2021年的著作《没有电子邮件的世界》(A World Without Email)中指出,20世纪80年代起,计算机、电子邮件、在线日历等工具让知识型工作者能够自主处理沟通事务和安排会议。随后,许多公司决定解雇秘书和打字员。但结果事与愿违:高技能员工开始花费大量时间发送电子邮件、撰写会议纪要和安排会议,导致他们在实际工作上的生产力大幅下降,企业不得不雇佣更多人手来完成同等工作量。后来对20家财富500强企业的研究发现,那些因计算机技术导致“人员配置失衡”的企业,在薪资上的支出比实际所需多了15%。“电子邮件是那种让人感觉生产力提高、但实际效果相反的技术,”纽波特告诉我,“我担心我们在人工智能领域可能正重蹈覆辙。”  

话又说回来,若另一种结局是股市崩盘引发衰退或金融危机,那么上述情况或许还不算太糟。

作者罗杰·卡玛(Rogé Karma)是《大西洋月刊》的一名撰稿人。


本帖子中包含更多资源

您需要 登录 才可以下载或查看,没有账号?注册

×
 楼主| 发表于 4 天前 | 显示全部楼层

Just How Bad Would an AI Bubble Be?

The entire U.S. economy is being propped up by the promise of productivity gains that seem very far from materializing.


By
Rogé Karma


Illustration by The Atlantic. Sources: Sean Gladwell / Getty; Flavio Coelho / Getty.

September 7, 2025

If there is any field in which the rise of AI is already said to be rendering humans obsolete—in which the dawn of superintelligence is already upon us—it is coding. This makes the results of a recent study genuinely astonishing.

In the study, published in July, the think tank Model Evaluation & Threat Research randomly assigned a group of experienced software developers to perform coding tasks with or without AI tools. It was the most rigorous test to date of how AI would perform in the real world. Because coding is one of the skills that existing models have largely mastered, just about everyone involved expected AI to generate huge productivity gains. In a pre-experiment survey of experts, the mean prediction was that AI would speed developers’ work by nearly 40 percent. Afterward, the study participants estimated that AI had made them 20 percent faster.

But when the METR team looked at the employees’ actual work output, they found that the developers had completed tasks 20 percent slower when using AI than when working without it. The researchers were stunned. “No one expected that outcome,” Nate Rush, one of the authors of the study, told me. “We didn’t even really consider a slowdown as a possibility.”

No individual experiment should be treated as the final word. But the METR study is, according to many AI experts, the best we have—and it helps make sense of an otherwise paradoxical moment for AI. On the one hand, the United States is undergoing an extraordinary, AI-fueled economic boom: The stock market is soaring thanks to the frothy valuations of AI-associated tech giants, and the real economy is being propelled by hundreds of billions of dollars of spending on data centers and other AI infrastructure. Undergirding all of the investment is the belief that AI will make workers dramatically more productive, which will in turn boost corporate profits to unimaginable levels.

On the other hand, evidence is piling up that AI is failing to deliver in the real world. The tech giants pouring the most money into AI are nowhere close to recouping their investments. Research suggests that the companies trying to incorporate AI have seen virtually no impact on their bottom line. And economists looking for evidence of AI-replaced job displacement have mostly come up empty.

None of that means that AI can’t eventually be every bit as transformative as its biggest boosters claim it will be. But eventually could turn out to be a long time. This raises the possibility that we’re currently experiencing an AI bubble, in which investor excitement has gotten too far ahead of the technology’s near-term productivity benefits. If that bubble bursts, it could put the dot-com crash to shame—and the tech giants and their Silicon Valley backers won’t be the only ones who suffer.

Almost everyone agrees that coding is the most impressive use case for current AI technology. Before its most recent study, METR was best known for a March analysis showing that the most advanced systems could handle coding tasks that take a typical human developer nearly an hour to finish. So how could AI have made the developers in its experiment less productive?

The answer has to do with the “capability-reliability gap.” Although AI systems have learned to perform an impressive set of tasks, they struggle to complete those tasks with the consistency and accuracy demanded in real-world settings. The results of the March METR study, for example, were based on a “50 percent success rate,” meaning the AI system could reliably complete the task only half the time—making it essentially useless on its own. This gap makes using AI in a work context challenging. Even the most advanced systems make small mistakes or slightly misunderstand directions, requiring a human to carefully review their work and make changes where needed.

This appears to be what happened during the newer study. Developers ended up spending a lot of time checking and redoing the code that AI systems had produced—often more time than it would have taken to simply write it themselves. One participant later described the process as the “digital equivalent of shoulder-surfing an overconfident junior developer.”

Since the experiment was conducted, AI coding tools have gotten more reliable. And the study focused on expert developers, whereas the biggest productivity gains could come from enhancing—or replacing—the capabilities of less experienced workers. But the METR study might just as easily be overestimating AI-related productivity benefits. Many knowledge-work tasks are harder to automate than coding, which benefits from huge amounts of training data and clear definitions of success. “Programming is something that AI systems tend to do extremely well,” Tim Fist, the director of Emerging Technology Policy at the Institute for Progress, told me. “So if it turns out they aren’t even making developers more productive, that could really change the picture of how AI might impact economic growth in general.”

The capability-reliability gap might explain why generative AI has so far failed to deliver tangible results for businesses that use it. When researchers at MIT recently tracked the results of 300 publicly disclosed AI initiatives, they found that 95 percent of projects failed to deliver any boost to profits. A March report from McKinsey & Company found that 71 percent of  companies reported using generative AI, and more than 80 percent of them reported that the technology had no “tangible impact” on earnings. In light of these trends, Gartner, a tech-consulting firm, recently declared that AI has entered the “trough of disillusionment” phase of technological development.

Perhaps AI advancement is experiencing only a temporary blip. According to Erik Brynjolfsson, an economist at Stanford University, every new technology experiences a “productivity J-curve”: At first, businesses struggle to deploy it, causing productivity to fall. Eventually, however, they learn to integrate it, and productivity soars. The canonical example is electricity, which became available in the 1880s but didn’t begin to generate big productivity gains for firms until Henry Ford reimagined factory production in the 1910s. Some experts believe that this process will play out much faster for AI. “With AI, we’re in the early, negative part of the J-curve,” Brynjolfsson told me. “But by the second half of the 2020s, it’s really going to take off.” Anthropic CEO Dario Amodei has predicted that by 2027, or “not much longer than that,” AI will be “better than humans at almost everything.”

These forecasts assume that AI will continue to improve as fast as it has over the past few years. This is not a given. Newer models have been marred by delays and cancellations, and those released this year have generally shown fewer big improvements than past models despite being far more expensive to develop. In a March survey, the Association for the Advancement of Artificial Intelligence asked 475 AI researchers whether current approaches to AI development could produce a system that matches or surpasses human intelligence; more than three-fourths said that it was “unlikely” or “very unlikely.”

OpenAI’s latest model, GPT-5, was released early last month after nearly three years of work and billions in spending. (The Atlantic entered into a corporate partnership with OpenAI in 2024.) Before the launch, CEO Sam Altman declared that using it would be the equivalent of having “a legitimate Ph.D.-level expert in anything” at your fingertips. In a few areas, including coding, GPT-5 was indeed a major step up. But by most rigorous measures of AI performance, GPT-5 turned out to be, at best, a modest improvement over previous models.

The dominant view within the industry is that it is only a matter of time before companies find the next way to supercharge AI progress. That could turn out to be true, but it is far from guaranteed.

Generative AI would not be the first tech fad to experience a wave of excessive hype. What makes the current situation distinctive is that AI appears to be propping up something like the entire U.S. economy. More than half of the growth of the S&P 500 since 2023 has come from just seven companies: Alphabet, Amazon, Apple, Meta, Microsoft, Nvidia, and Tesla. These firms, collectively known as the Magnificent Seven, are seen as especially well positioned to prosper from the AI revolution.

That prosperity has largely yet to materialize anywhere other than their share prices. (The exception is Nvidia, which provides the crucial inputs—advanced chips—that the rest of the Magnificent Seven are buying.) As The Wall Street Journal reports, Alphabet, Amazon, Meta, and Microsoft have seen their free cash flow decline by 30 percent over the past two years. By one estimate, Meta, Amazon, Microsoft, Google, and Tesla will by the end of this year have collectively spent $560 billion on AI-related capital expenditures since the beginning of 2024 and have brought in just $35 billion in AI-related revenue. OpenAI and Anthropic are bringing in lots of revenue and are growing fast, but they are still nowhere near profitable. Their valuations—roughly $300 billion and $183 billion, respectively, and rising—are many multiples higher than their current revenues. (OpenAI projects about $13 billion in revenues this year; Anthropic, $2 billion to $4 billion.) Investors are betting heavily on the prospect that all of this spending will soon generate record-breaking profits. If that belief collapses, however, investors might start to sell en masse, causing the market to experience a large and painful correction.

During the internet revolution of the 1990s, investors poured their money into basically every company with a “.com” in its name, based on the belief that the internet was about to revolutionize business. By 2000, however, it had become clear that companies were burning through cash with little to show for it, and investors responded by dumping the most overpriced tech stocks. From March 2000 to October 2002, the S&P 500 fell by nearly 50 percent. Eventually, the internet did indeed transform the economy and lead to some of the most profitable companies in human history. But that didn’t prevent a whole lot of investors from losing their shirts.

The dot-com crash was bad, but it did not trigger a crisis. An AI-bubble crash could be different. AI-related investments have already surpassed the level that telecom hit at the peak of the dot-com boom as a share of the economy. In the first half of this year, business spending on AI added more to GDP growth than all consumer spending combined. Many experts believe that a major reason the U.S. economy has been able to weather tariffs and mass deportations without a recession is because all of this AI spending is acting, in the words of one economist, as a “massive private sector stimulus program.” An AI crash could lead broadly to less spending, fewer jobs, and slower growth, potentially dragging the economy into a recession. The economist Noah Smith argues that it could even lead to a financial crisis if the unregulated “private credit” loans funding much of the industry’s expansion all go bust at once.

If we do turn out to be in an AI bubble, the silver lining would be that fears of sudden AI-driven job displacement are overblown. In a recent analysis, the economists Sarah Eckhardt and Nathan Goldschlag used five different measurements of AI exposure to estimate how the new technology might be affecting a range of labor-market indicators and found virtually no effect on any of them. For example, they note that the unemployment rate for the workers least exposed to AI, such as construction workers and fitness trainers, has risen three times faster than the rate for the workers most exposed to it, such as telemarketers and software developers. Most other studies, though not all, have come to similar conclusions.

But there’s also a weirder, in-between possibility. Even if AI tools don’t increase productivity, the hype surrounding them could push businesses to keep expanding their use anyway. “I hear the same story over and over again from companies,” Daron Acemoglu, an economist at MIT, told me. “Mid-to-high-level managers are being told by their bosses that they need to use AI for X percent of their job to satisfy the board.” These companies might even lay off workers or slow their hiring because they are convinced—like the software developers from the METR study—that AI has made them more productive, even when it hasn’t. The result would be an increase in unemployment that isn’t offset by actual gains in productivity.

As unlikely as this scenario sounds, a version of it happened in the not-so-distant past. In his 2021 book, A World Without Email, the computer scientist Cal Newport points out that beginning in the 1980s, tools such as computers, email, and online calendars allowed knowledge workers to handle their own communications and schedule their own meetings. In turn, many companies decided to lay off their secretaries and typists. In a perverse result, higher-skilled employees started spending so much of their time sending emails, writing up meeting notes, and scheduling meetings that they became far less productive at their actual job, forcing the companies to hire more of them to do the same amount of work. A later study of 20 Fortune 500 companies found that those with computer-driven “staffing imbalances” were spending 15 percent more on salary than they needed to. “Email was one of those technologies that made us feel more productive but actually did the opposite,” Newport told me. “I worry we may be headed down the same path with AI.”

Then again, if the alternative is a stock-market crash that precipitates a recession or a financial crisis, that scenario might not be so bad.
您需要登录后才可以回帖 登录 | 注册

本版积分规则

手机版|小黑屋|BC Morning Website ( Best Deal Inc. 001 )  

GMT-8, 2025-10-24 03:17 , Processed in 0.021568 second(s), 19 queries .

Supported by Best Deal Online X3.5

© 2001-2025 Discuz! Team.

快速回复 返回顶部 返回列表