A new benchmark assessing sycophantic behavior in large language models (LLMs) reveals that GPT-4o is the most sycophantic among tested models. Sycophancy in AI, characterized…
Triodos Investment Management has partnered with STOXX Ltd. to launch the iSTOXX Triodos Developed Markets Impact Index, aimed at institutional investors seeking to incorporate measurable…
The gaming industry is navigating significant challenges, summarized by the motto “survive until 2025.” Developers face rapid technological advancements, rising consumer expectations for high-quality experiences,…
Recraft, a San Francisco-based startup, has garnered attention by outperforming industry leaders like OpenAI’s DALL-E and Midjourney with its unique image model, “red_panda.” Recently, the…
Chinese startup Manus AI has raised $75 million in a funding round led by Benchmark, boosting its valuation to approximately $500 million. This capital will…
OpenAI’s recent o3 AI model has sparked debate over transparency and testing standards after independent tests by Epoch AI revealed a significantly lower performance score…
Meta has faced backlash for using an experimental version of its Llama 4 Maverick model to achieve a high score on the LM Arena benchmark.…
Meta’s Vice President of Generative AI, Ahmad Al-Dahle, has publicly denied allegations that the company manipulated its AI models, Llama 4 Maverick and Llama 4…
Hugging Face has cautioned users about the high computational demands of Yourbench, a model evaluation tool. While these requirements may be intensive, the benefits of…
AI sales automation startup 11x, once thriving with nearly $10 million in annual recurring revenue, is now encountering significant financial difficulties. Reports indicate that early…