Baidu unveiled its new ERNIE multimodal model, designed to outperform leading systems such as GPT and Gemini on tasks involving visual, schematic, and multimedia-intensive inputs. The company emphasized that ERNIE’s architecture enables stronger cross-modal reasoning for industries including manufacturing, design, engineering, and digital media. Early benchmarks show improvements in comprehension, structured output generation, and context alignment. Analysts noted that the launch reinforces Baidu’s intent to maintain leadership within China’s enterprise AI ecosystem as adoption accelerates.