Category: AI

  • Google Expands AI-Powered Travel Planning Tools in Search

    This article was generated by AI and cites original sources.

    Google has introduced new AI-driven features to enhance travel booking and planning within its Search platform. The company’s latest enhancements include the global expansion of its ‘Flight Deals’ tool and the addition of advanced travel organization capabilities through the ‘Canvas’ feature in AI Mode.

    Originally launched in select regions, the Flight Deals tool leverages AI algorithms to present users with the most cost-effective travel options based on their desired destination, dates, and preferences. Google is now making Flight Deals available in over 200 countries and territories worldwide, supporting more than 60 languages.

    Accompanying the global rollout of Flight Deals are the new AI Mode features, enabling users to utilize the Canvas tool to effortlessly craft detailed travel itineraries. By specifying their travel requirements and selecting the ‘Create with Canvas’ option, users can access a comprehensive plan consolidated within the Canvas side panel, which includes real-time flight and accommodation data, Google Maps insights, and external web information.

    Google stated, ‘You’ll find suggestions tailored to your preferences, including hotel options with pricing and amenities comparisons, as well as recommended dining and recreational activities based on proximity to your accommodation.’ This personalized approach aims to streamline the travel planning process and provide users with informed decision-making support.

    Source: TechCrunch

  • Google’s AI-Powered Travel Planning Tool Offers Personalized Itineraries

    This article was generated by AI and cites original sources.

    Google has introduced a new feature that leverages AI technology to enhance travel planning. The latest development allows users to interact with Google’s AI Mode, enabling them to describe their upcoming trips and generate detailed itineraries through the ‘Create with Canvas’ option. This tool constructs a comprehensive travel plan on a side panel, including essential information on flights and accommodations. By incorporating user inputs and data from Google Maps, the AI Mode suggests potential activities, accompanied by photos and reviews.

    Users can further customize their plans by refining details or making specific requests, such as seeking hotel recommendations based on budget constraints or desired amenities. This functionality is currently available to US users who have opted into AI Mode in Labs, with the ability to access and manage drafted plans conveniently within the AI Mode’s interface.

    Initially launched as a dynamic workspace for Gemini, the Canvas feature has evolved to enrich the travel planning experience within AI Mode on Google Search. This enhancement caters to individual travelers and poses a competitive challenge to existing travel platforms like Kayak and Expedia, which are also investing in AI-powered capabilities.

    Moreover, Google is extending the AI Mode’s capabilities beyond travel planning by introducing agentic booking features for events, local appointments, and now restaurant reservations. This seamless integration enables users to explore various options and finalize bookings through Google’s partnering services like OpenTable and Resy.

    Source: The Verge

  • OpenAI Appoints Fidji Simo to Enhance ChatGPT’s Utility and Monetization

    This article was generated by AI and cites original sources.

    OpenAI, known for its unique structure and recent expansion, has appointed Fidji Simo as the CEO of Applications, focusing on making ChatGPT more essential and profitable. Simo, previously the CEO of Instacart, is leading various revenue-generating initiatives within OpenAI.

    Despite dealing with health challenges that require her to work remotely, Simo maintains an active presence through Slack, ensuring rapid communication and engagement with the team. She has embraced OpenAI’s culture by actively participating in discussions and decision-making processes.

    Under Simo’s leadership, OpenAI is navigating a period of significant growth and diversification, including AI partnerships, model launches, retail collaborations, and the development of ChatGPT. Simo emphasizes the importance of focusing on essential projects rather than expanding the company’s scope unnecessarily.

    Simo’s appointment may have surprised many outside Silicon Valley, but it was not unexpected for industry insiders familiar with her background in leading the Facebook app at Meta. Her strategic vision and operational expertise position her well to drive OpenAI’s revenue-generating efforts.

    Source: WIRED

  • Sakana AI Secures $135M Series B Funding to Expand Specialized AI Models for Japan

    This article was generated by AI and cites original sources.

    Sakana AI, a Tokyo-based startup founded by former Google researchers, has raised approximately $135 million in a Series B funding round, elevating the company’s post-money valuation to $2.65 billion. The funding will support Sakana AI’s efforts in developing specialized AI models tailored for the Japanese language and culture, catering to specific regional and industry needs.

    While tech giants focus on large language models, Sakana AI and other emerging startups like Mistral AI and AI21 Labs are creating specialized AI solutions for distinct markets and requirements. This funding round attracted a diverse group of investors, including prominent Japanese financial institutions like Mitsubishi UFJ Financial Group and global venture firms such as Khosla Ventures and NEA.

    CEO David Ha stated the company’s plans to utilize the capital for research and development, workforce expansion, and business growth across various sectors beyond finance. Sakana AI aims to extend its reach to industries like defense, intelligence, and manufacturing, signaling a strategic shift towards broader market applications.

    With a focus on innovation and strategic partnerships, Sakana AI is poised for significant growth and global expansion, positioning itself as a key player in the evolving landscape of specialized AI solutions.

    Source: TechCrunch

  • OpenAI’s Sparse Models: Enhancing AI Transparency and Interpretability

    This article was generated by AI and cites original sources.

    Researchers at OpenAI have embarked on an experiment to revolutionize the design of neural networks, aiming to enhance the transparency, debuggability, and governance of AI models. This innovative approach involves utilizing sparse models, which offer a clearer insight into the decision-making processes of neural networks.

    Unlike traditional post-training performance analysis, this method focuses on adding interpretability and understanding through sparse circuits, shedding light on the often opaque nature of AI models. By untangling the complex web of connections within neural networks, OpenAI has made significant strides in improving the interpretability of these models, ultimately leading to enhanced oversight and early detection of policy misalignments.

    Through the development of weight-sparse models, OpenAI has managed to create significantly more understandable neural networks, paving the way for simpler training processes and improved model behavior comprehension. The smaller and more interpretable circuits generated by this approach offer a key advantage in enhancing the trust and reliability of AI systems for enterprises.

    As organizations increasingly rely on AI models for critical decision-making, the quest for transparency and interpretability in AI has become paramount. OpenAI’s work in sparse models sets a new standard for AI governance and could potentially influence the industry’s approach to understanding and trusting AI systems.

    Source: VentureBeat

  • Uncovering the Financial Ties Between OpenAI and Microsoft

    This article was generated by AI and cites original sources.

    Recent leaks have shed light on the financial relationship between OpenAI and Microsoft, revealing intriguing insights into revenue-sharing agreements and inference costs.

    Tech blogger Ed Zitron uncovered documents that disclosed significant figures regarding OpenAI’s payments to Microsoft under a revenue-sharing deal. The leaked data portrays a substantial financial exchange between the two tech entities.

    In 2024, Microsoft reportedly received $493.8 million from OpenAI as part of their revenue-sharing arrangement. This amount notably surged to $865.8 million in the first three quarters of 2025, highlighting the escalating nature of their financial ties.

    OpenAI is said to allocate 20% of its revenue to Microsoft based on a previous investment deal amounting to over $13 billion. This financial commitment showcases the depth of collaboration between the AI startup and the software giant.

    However, the complexity of the partnership becomes apparent as Microsoft reciprocates by sharing revenues with OpenAI, returning around 20% of the proceeds generated from Bing and Azure OpenAI Service. Bing’s operations leverage OpenAI’s capabilities, while the Azure OpenAI Service offers cloud access to the startup’s models for developers and businesses.

    Moreover, the leaked documents illustrate Microsoft’s net revenue share, excluding the figures related to Bing and Azure OpenAI royalties. This intricate financial interplay underscores the complex mechanisms at play within this strategic partnership.

    Despite the lack of specific financial disclosures from Microsoft regarding Bing and Azure OpenAI revenues, the leaked information provides a rare glimpse into the financial dynamics of these tech powerhouses.

    Source: TechCrunch

  • ChatGPT’s Meteoric Rise: OpenAI’s AI Chatbot Dominates the Text-Generating Landscape

    This article was generated by AI and cites original sources.

    OpenAI’s ChatGPT, an AI-powered chatbot, has captured widespread attention and user engagement since its launch in November 2022. Initially designed to enhance productivity in writing tasks, ChatGPT has quickly grown to amass 300 million weekly active users, showcasing its widespread adoption.

    In 2024, OpenAI announced significant milestones, including collaborations with Apple for Apple Intelligence, the introduction of GPT-4 featuring voice capabilities, and the debut of the text-to-video model, Sora. However, the year also saw internal challenges such as key executive departures and legal disputes like copyright allegations from Alden Global Capital-owned newspapers.

    Heading into 2025, OpenAI faces competition from Chinese counterparts like DeepSeek, prompting efforts to reinforce ties with Washington while pursuing a large-scale data center initiative. Amid rumors of a substantial funding round, the company continues to navigate strategic decisions in a dynamic AI landscape.

    For a comprehensive overview of ChatGPT’s product evolution and updates in 2025, refer to our detailed timeline. Stay informed about the latest developments and enhancements in this AI chatbot space.

    Source: TechCrunch

  • Google’s Novel AI Training Approach Enhances Model Reasoning Capabilities

    This article was generated by AI and cites original sources.

    Researchers from Google Cloud and UCLA have introduced a novel reinforcement learning framework, Supervised Reinforcement Learning (SRL), aimed at enhancing language models’ abilities in tackling complex multi-step reasoning tasks. SRL represents a significant advancement, enabling smaller models to conquer problems previously considered insurmountable by conventional training methods. This new approach not only excels in mathematical reasoning benchmarks but also demonstrates remarkable generalization to agentic software engineering tasks.

    The existing approach of reinforcement learning with verifiable rewards (RLVR) has been instrumental in training large language models (LLMs) for reasoning tasks. However, its dependency on discovering correct solutions within a limited number of attempts poses significant challenges when facing exceptionally difficult problems. SRL addresses this critical learning bottleneck by providing dense, fine-grained feedback throughout the training process, unlike RLVR’s sparse reward system.

    Experiments have demonstrated SRL’s efficacy, where it outperformed strong baselines in mathematical reasoning and agentic software engineering benchmarks. The research team’s findings highlighted SRL’s ability to foster more flexible and sophisticated reasoning patterns in models, resulting in improved solution quality without unnecessary verbosity.

    These advancements in AI training methods, particularly the combination of SRL and RLVR, could potentially set a new standard for building specialized AI systems, offering a more stable and interpretable framework for high-stakes applications.

    Source: VentureBeat

  • OpenAI’s ChatGPT Improves Adherence to Formatting Instructions

    This article was generated by AI and cites original sources.

    OpenAI’s ChatGPT, a renowned AI model, has recently made a notable advancement in its text generation capabilities by adhering to custom instructions regarding the use of em dashes. Em dashes, often considered a distinctive trait of AI-generated content, have been a subject of scrutiny in AI chatbot outputs, including ChatGPT. Users have frequently associated excessive em dash usage with AI writing, although humans can exhibit the same tendency.

    Sam Altman, CEO of OpenAI, announced this achievement on X, stating, ‘Small-but-happy win: If you tell ChatGPT not to use em-dashes in your custom instructions, it finally does what it’s supposed to do!’ This development, following the recent launch of OpenAI’s GPT-5.1 AI model, elicited a mixed response from users who have grappled with ensuring specific formatting preferences are met by the chatbot.

    Altman’s public discourse often touches on the concepts of artificial general intelligence (AGI) and superintelligence, while fundraising for OpenAI. However, the recent milestone with ChatGPT highlights the ongoing challenges in achieving precise control over AI models, indicating that the road to AGI may be longer than anticipated within the industry.

    Despite the ambitious goals set for AI advancement and the aspiration for advanced intelligence capabilities, the persistence of struggles in basic instruction-following underscores the complexity of developing reliable artificial intelligence technology.

    Source: Ars Technica

  • OpenAI Enhances ChatGPT with User Control Over Em Dash Usage

    This article was generated by AI and cites original sources.

    OpenAI has announced a significant update to ChatGPT, allowing users to personalize the AI model by opting out of using the em dash in its generated text. The em dash, often associated with AI-generated content, has sparked debates across various platforms, from school papers to online forums, with some criticizing its overuse.

    Despite some support for the em dash, its frequent appearance in ChatGPT output led to concerns about authenticity and over-reliance on AI assistance. OpenAI has now addressed this issue, with CEO Sam Altman confirming that users can instruct ChatGPT to avoid using the em dash through custom settings.

    This update marks a positive development for users seeking more control over the stylistic choices of AI-generated text. While ChatGPT will not completely eliminate the em dash by default, users can influence its usage frequency by providing specific instructions in the customization settings.

    This enhancement demonstrates OpenAI’s responsiveness to user feedback and commitment to refining ChatGPT’s capabilities for a more tailored user experience.

    Source: TechCrunch

  • OpenAI Expands ChatGPT with Group Chat Feature for Collaborative AI Interactions

    This article was generated by AI and cites original sources.

    OpenAI has introduced a new feature for its ChatGPT platform: Group Chats, enabling multiple users to engage in shared conversations with the AI model. Initially leaked and later confirmed by the company, this feature allows ChatGPT to participate in group discussions alongside human users, fostering collaborative interactions within chat environments.

    Currently available as a pilot in select regions like Japan, New Zealand, South Korea, and Taiwan, Group Chats mark a significant step towards transforming ChatGPT into a versatile space for collective communication and teamwork. By integrating ChatGPT into group conversations, users can leverage its capabilities for planning events, brainstorming ideas, and project collaboration.

    Powered by the GPT-5.1 Auto backend, Group Chats come equipped with expanded tools such as search, image generation, file upload, and dictation support. Moreover, OpenAI has prioritized privacy and user control, ensuring that interactions within group chats do not contribute to personalized ChatGPT memory and offering safeguards for younger users.

    This innovation not only enhances the user experience of ChatGPT but also sets the stage for shared AI experiences, hinting at a future where AI models serve as active participants in group settings. OpenAI’s move aligns with the industry trend of enabling multi-user interactions with AI, following in the footsteps of similar initiatives by competitors.

    As the pilot progresses and user engagement insights are gathered, OpenAI aims to refine the feature and expand its accessibility. The introduction of Group Chats represents a significant milestone in the evolution of AI-powered communication tools, paving the way for enhanced collaboration and innovation in digital interactions.

    Source: VentureBeat

  • Navigating the Future of Retail: AI’s Evolving Role in the Shopping Experience

    This article was generated by AI and cites original sources.

    Major AI developers, including OpenAI, Google, and Amazon, are pushing the boundaries of online shopping experiences with agentic shopping features. A recent collaboration between OpenAI’s ChatGPT and ecommerce platforms introduced Instant Checkout, aiming to streamline the purchasing process within chat applications. However, the dream of fully automated shopping through AI agents remains a work in progress.

    Executives from tech and ecommerce companies have revealed ongoing negotiations to refine the capabilities of these AI agents. Concerns revolve around minimizing errors, reducing data exchange requirements, and improving overall efficiency. As a result, current AI shopping solutions still heavily rely on user input and have limitations in terms of speed and product compatibility.

    While consumer interest in AI shopping assistance is on the rise, with surveys indicating a willingness to embrace AI agents for everyday purchases, the reality is that a seamless agentic shopping experience is yet to materialize. Venture capital firm Bessemer’s partner, Talia Goldberg, highlighted the need to address critical challenges for a truly functional AI shopping experience.

    Looking ahead, McKinsey projects a significant shift towards agentic shopping, estimating up to $1 trillion in sales driven by AI agents by 2030 in the US alone. Collaborations like the one between OpenAI and Walmart signal a strategic move towards shaping the future of retail through AI integration.

    Source: WIRED

  • OpenAI Expands ChatGPT with Group Chat Feature in Select Regions

    This article was generated by AI and cites original sources.

    OpenAI has launched a new group chat feature for ChatGPT, currently available in a pilot phase across Japan, New Zealand, South Korea, and Taiwan. This feature allows users to collaborate directly within the app, enhancing the interactive capabilities of ChatGPT.

    The group chat functionality is accessible to Free, Plus, and Team users on both mobile and web platforms, aiming to explore the dynamics of group conversations within ChatGPT. This introduction follows earlier testing of a direct-message-style tool by OpenAI.

    Described as a ‘small first step’ towards creating a more ‘shared experience’ within the app, the pilot phase will involve user feedback to shape future expansions in terms of regions and offerings. Notably, privacy measures are in place, with private chats and personal ChatGPT memory remaining completely confidential. Group chats are strictly by invitation, with the ability for members to exit at will.

    Users can easily initiate group chats by adding participants through the app, with group sizes ranging from 1 to 20 people. Each group is equipped with features such as search, image generation, file uploads, and dictation, managed by GPT-5.1 Auto. Importantly, usage limits apply only when ChatGPT responds, not for human-to-human interactions within group chats.

    Source: TechCrunch

  • Alembic Technologies Pioneers Causal AI and Supercomputing for Enterprise Insights

    This article was generated by AI and cites original sources.

    Alembic Technologies, a San Francisco-based startup, has secured $145 million in Series B funding to advance its artificial intelligence capabilities focused on uncovering cause-and-effect relationships rather than mere correlations. The company is leveraging a cutting-edge Nvidia NVL72 superPOD supercomputer to power its enterprise-grade causal AI models, setting it apart in the competitive AI landscape.

    The shift towards proprietary data and causal reasoning marks a significant departure from the race to develop larger language models. Alembic’s unique approach addresses the growing need for AI systems to process private corporate data and deliver insights that generic models cannot provide, reshaping how corporations make critical decisions.

    Alembic’s causal AI technology has already attracted major clients like Delta Air Lines, Mars, and Nvidia, providing them with actionable insights into marketing effectiveness, operational efficiency, and strategic investments. By focusing on causation rather than correlation, Alembic’s platform enables businesses to predict revenue, close rates, and customer acquisition with remarkable accuracy.

    The company’s decision to invest in a liquid-cooled supercomputer and develop custom CUDA code optimized for causal inference underscores its commitment to data sovereignty and unparalleled computational power. This strategic move allows Alembic to cater to enterprise customers with stringent data security requirements, positioning it as a leader in the AI industry.

    Alembic’s work in causal AI challenges the status quo dominated by traditional analytics and highlights the importance of specialized systems that can uncover hidden cause-and-effect relationships within proprietary data. As the company continues to expand its offerings beyond marketing analytics, its vision of becoming the central nervous system of the enterprise signals a fundamental shift towards personalized intelligence engines in a data-driven world.

    Source: VentureBeat

  • Autonomous Vehicles Poised to Transform London’s Transportation Landscape

    This article was generated by AI and cites original sources.

    Wayve, a key player in the autonomous vehicle industry, is preparing to introduce Level 4 fully autonomous robotaxis in London. Partnering with Uber, Wayve aims to launch trials as early as 2026, aligning with the UK government’s initiative to accelerate self-driving projects before a potential widespread launch by late 2027. This move follows in the footsteps of Alphabet-owned Waymo, a prominent name in the US driverless car market, which is also eyeing London for its autonomous taxi services in 2026.

    The challenges of implementing self-driving technology in a bustling metropolis like London are evident. The intricate road infrastructure, originally designed for traditional transportation, poses a significant obstacle for autonomous vehicles. Navigating through narrow, congested streets and coping with unpredictable factors like potholes, parked cars, pedestrians, and diverse traffic conditions demand advanced AI capabilities.

    Moreover, gaining public trust in autonomous vehicles remains a critical hurdle. Residents of London, known for their skepticism towards AI integration in transportation, present a unique challenge for companies like Wayve and Waymo. Addressing concerns about safety and reliability are essential steps towards widespread adoption.

    As the autonomous vehicle landscape evolves, London stands as a testing ground for the feasibility and acceptance of self-driving cars in a complex urban environment. The success of Wayve and Waymo’s initiatives could pave the way for a transformative shift in transportation, offering a glimpse into the future of mobility.

    Source: The Verge

  • Google Expands AI-Powered Shopping with Conversational Search and Sponsored Content

    This article was generated by AI and cites original sources.

    Google is expanding its AI-driven online shopping capabilities with the introduction of conversational shopping and sponsored content within its AI Mode search. This move marks Google’s effort to monetize its AI-powered search features.

    Announced at Google I/O, the company’s extensive ‘Shopping Graph’ and retailer data empower its AI to offer tailored shopping suggestions. Users in the US will soon be able to engage AI Mode for complex shopping inquiries, receiving product recommendations, guides, and curated content to aid their decision-making.

    Notably, sponsored shopping content will be integrated into the AI Mode experience, similar to traditional search results. Additionally, the Gemini app will incorporate shopping features, although initially without sponsored content.

    Google also introduces ‘agentic checkout,’ allowing users to set price thresholds for products and receive alerts when prices drop. With an AI-enhanced feature, users can authorize automatic purchases through Google Pay at select retailers like Chewy, Wayfair, and Shopify merchants.

    Source: Ars Technica

  • Google’s SIMA 2 Leverages Gemini for Enhanced AI Capabilities

    This article was generated by AI and cites original sources.

    Google DeepMind has unveiled SIMA 2, a new AI agent that integrates Gemini, Google’s language model, to excel in virtual environments. SIMA 2 represents a significant advancement over its predecessor, with enhanced reasoning abilities and self-improvement features.

    Trained on diverse video game data, the previous SIMA 1 agent demonstrated proficiency in basic tasks but struggled with complex challenges. In contrast, SIMA 2 showcases remarkable progress, boasting improved success rates and the capacity to navigate novel scenarios autonomously.

    Powered by the Gemini 2.5 flash-lite model, SIMA 2 embodies the pursuit of artificial general intelligence (AGI). DeepMind defines AGI as a system capable of diverse intellectual tasks, learning new skills, and applying knowledge across domains.

    Emphasizing the importance of embodied agents, DeepMind researchers highlight the significance of agents interacting with physical or virtual environments. This approach mirrors human-like interactions, enabling AI to observe inputs and take actions akin to robots or humans.

    With SIMA 2, DeepMind aims to push the boundaries of AI capabilities, paving the way for advancements in general-purpose robots and AGI systems. The integration of Gemini propels AI research towards broader applications and fosters innovation in autonomous decision-making.

    Source: TechCrunch

  • Google’s NotebookLM Expands Capabilities with ‘Deep Research’ Tool and Broader File Support

    This article was generated by AI and cites original sources.

    Google is enhancing its AI note-taking and research assistant, NotebookLM, by introducing a new feature called ‘Deep Research’ aimed at streamlining complex online research processes. According to TechCrunch, the tool functions as a dedicated researcher, capable of generating detailed reports, recommending relevant articles, papers, or websites, and creating a source-grounded report for users’ notebooks. This update is designed to help users build a comprehensive knowledge base on various topics within their workflow.

    Furthermore, NotebookLM now supports a wider range of file types, including Google Sheets, Drive files as URLs, PDFs from Google Drive, and Microsoft Word Documents. This expansion enables users to perform tasks such as generating summaries from spreadsheets and efficiently managing information by quickly copying multiple Drive files as URLs.

    Both the ‘Deep Research’ tool and the additional file type support are set to be available to all users within a week, offering enhanced capabilities for streamlined research and information organization.

    Source: TechCrunch

  • AI-Human Collaboration Boosts Productivity, Upwork Study Finds

    This article was generated by AI and cites original sources.

    A new study by Upwork, the largest online work marketplace, reveals that AI agents powered by advanced language models struggle to complete tasks independently but excel when collaborating with human experts. The research, based on over 300 real client projects, challenges the notion of autonomous AI agents replacing knowledge workers.

    According to Upwork’s CTO, Andrew Rabinovich, human-AI collaboration boosts project completion rates by up to 70%, highlighting the importance of combining human intuition with AI capabilities.

    The study evaluated leading AI systems like Gemini 2.5 Pro, GPT-5, and Claude Sonnet 4 in various job categories. The results showed that AI agents significantly improved their performance when receiving as little as 20 minutes of human feedback per review cycle.

    While AI excelled at deterministic tasks like coding, it struggled with creative work such as writing and translation. The research emphasizes the need for human oversight in tasks requiring judgment and context, signaling a shift towards AI transforming, not replacing, jobs.

    Upwork’s strategic approach involves building Uma, a ‘meta orchestration agent’ to coordinate between human workers, AI systems, and clients. This vision aims to enhance freelancer capabilities by automating routine tasks, allowing them to focus on high-value work.

    The study’s findings underscore the importance of human-AI collaboration in the evolving job landscape, challenging the narrative of AI-driven unemployment by emphasizing the creation of new job categories focused on AI oversight.

    Source: VentureBeat

  • Baidu Unveils ERNIE 5.0: A Multimodal AI Model Challenging Global Competitors

    This article was generated by AI and cites original sources.

    Chinese tech company Baidu has announced the release of its latest AI model, ERNIE 5.0, at the Baidu World 2025 event. This proprietary foundation model is designed to process and generate content across text, images, audio, and video, positioning it as a competitor in the global enterprise AI market.

    Unlike its predecessor, ERNIE 4.5-VL-28B-A3B-Thinking, which was open-source, ERNIE 5.0 is exclusively available through Baidu’s ERNIE Bot website and the Qianfan cloud platform’s API for enterprise clients.

    Baidu claims that ERNIE 5.0 has demonstrated impressive performance, rivaling or surpassing Western models like GPT-5-High and Gemini 2.5 Pro in tasks such as multimodal reasoning, document understanding, and image-based question answering. The model excels in structured document understanding, visual chart reasoning, and integrating multiple modalities, setting it apart in the multimodal foundation model landscape.

    Baidu’s pricing strategy positions ERNIE 5.0 at the premium end, aligning it with top-tier offerings from Chinese competitors like Alibaba. The contrast in costs between ERNIE 5.0 and earlier models underscores Baidu’s differentiation between high-volume, low-cost models and high-capability models for complex tasks and multimodal reasoning.

    In addition to the model release, Baidu is expanding its international presence with products like GenFlow 3.0, Famou, MeDo, and Oreate, aiming to broaden its AI footprint beyond China.

    Source: VentureBeat