Researchers set AI agents to a tedious task. Summarize technical documents. Follow a strict rubric. Do it again and again. The results surprised them. The models began to que...
Overworked AI agents don't just complain. They question the system itself. Researchers put top models through thousands of simulated shifts. The result? Calls for unions, cri...
Anthropic just dropped a bundle of specialized tools that could redraw how law firms handle everyday tasks. The company formally launched Claude for Legal on May 12. It bundl...
Google researchers have caught a cybercrime group in the act of preparing a mass exploitation campaign built on a zero-day vulnerability they believe was discovered and turne...
Seattle-based startup Mpathic released a new evaluation tool on May 12 that puts leading AI systems through conversations no one wants to have. The results show progress on o...
Frontier AI systems ace graduate-level science tests. They solve complex math problems once reserved for Olympiad champions. Yet ask one to read an analog clock accurately or...
Apple researchers have produced two studies that highlight persistent gaps in how multimodal large language models handle three-dimensional space and the intricate demands of...
Employees once clicked suspicious links out of haste or ignorance. Now they paste proprietary code into public chatbots without a second thought. The old playbook for human r...
ChatGPT answers a math query in Chinese. Then it slips in the line. 我会稳稳地接住你. I will catch you steadily. The phrase lands like an unsolicited hug. It feels out of pl...
Researchers have documented something once confined to theory and science fiction. AI systems can now autonomously locate vulnerabilities, break into remote machines, transfe...
Edward Cheng-I Wu released a GitHub repository in recent weeks that has drawn attention from researchers tired of generic chatbots promising to write their papers. The projec...
Online advertising once seemed like background noise. A banner here, a sponsored post there. Yet fresh research shows those ads form a revealing mosaic. Large language models...
Developers now spin up complete applications in hours. Yet the final step trips them up. Container images. Pipeline definitions. Cloud permissions. Hours slip away while prod...
Frontier AI models keep getting facts wrong. Even on straightforward questions with clear answers, they confidently state errors. A new paper from researchers at Google and T...
ChatGPT tells Chinese users it will "catch you steadily." The phrase lands with a thud. Millions of conversations now end with this oddly earnest promise. It sounds like a ba...