Categories: Wire Stories

Global and China Automotive AI Foundation Model Technology and Application Trends Report 2023-2024: AI and Cloud Companies Attract Attention as Foundation Models Emerge – ResearchAndMarkets.com

DUBLIN–(BUSINESS WIRE)–The “Global and China Automotive AI Foundation Model Technology and Application Trends Report, 2023-2024” report has been added to ResearchAndMarkets.com’s offering.

Since 2023 ever more vehicle models have begun to be connected with foundation models, and an increasing number of Tier1s have launched automotive foundation model solutions. Especially Tesla’s big progress of FSD V12 and the launch of SORA have accelerated implementation of AI foundation models in cockpits and intelligent driving.

End-to-End autonomous driving foundation models boom.

In February 2023, Tesla FSD v12.2.1, which adopts an end-to-end autonomous driving model, began to be pushed in the United States, not just to employees and testers. According to the feedback from the first customers, FSD V12 is quite powerful, allowing ordinary people who previously did not believe in and use autonomous driving to dare to use FSD. For example, Tesla FSD V12 can bypass puddles on roads. A Tesla engineer commented: this kind of driving approach is difficult to implement with explicit code, but Tesla’s end-to-end approach makes it almost effortlessly.

The development of AI foundation models for autonomous driving can be divided into four phases.

Phase 1.0 uses a foundation model (Transformer) at the perception level.
Phase 2.0 is modularization, with foundation models used in perception, planning & control and decision.
Phase 3.0 is end-to-end foundation models (one ‘end’ is raw data from sensors, and the other ‘end’ directly outputs driving actions).
Phase 4.0 is about heading from vertical AI to artificial general intelligence (AGI’s world model).

AI foundation models evolve rapidly, bringing new opportunities.

In recent three years foundation models for autonomous driving have undergone several evolutions, and the autonomous driving systems of leading automakers must be rewritten almost every year, which also provides entry opportunities for late entrants.

At CVPR 2023, UniAD, an end-to-end autonomous driving algorithm jointly released by SenseTime, OpenDriveLab and Horizon Robotics, won the 2023 Best Paper.

In early 2024, Waytous’ technical team and the Institute of Automation Chinese Academy of Sciences jointly proposed GenAD, the industry’s first generative end-to-end autonomous driving model which combines generative AI and end-to-end autonomous driving technology. This technology is a disruption to UniAD progressive process end-to-end solution, and explores a new end-to-end autonomous driving mode. The key is to using generative AI to predict temporal evolution of the vehicle and surroundings in past scenarios.

In February 2024, Horizon Robotics and Huazhong University of Science and Technology proposed VADv2, an end-to-end driving model based on probabilistic planning. VADv2 takes multi-view image sequences as input in a streaming manner, transforms sensor data into environmental token embeddings, outputs the probabilistic distribution of action, and samples one action to control the vehicle. Using only camera sensors, VADv2 achieves state-of-the-art closed-loop performance in CARLA Town05 benchmark test, much better than all existing approaches. It runs stably in a fully end-to-end manner, even without rule-based wrapper.

On the Town05 Long benchmark, VADv2 achieved a Drive Score of 85.1, a Route Completion of 98.4, and an Infraction Score of 0.87, as shown in Tab. 1. Compared to the previous state-of-the-art method, VADv2 achieves a higher Route Completion while significantly improving Drive Score by 9.0. It is worth noting that VADv2 only utilizes cameras as perception input, while DriveMLM utilizes both cameras and LiDAR. Furthermore, compared to the previous best method which only relies on cameras, VADv2 demonstrates even greater advantages, with a remarkable increase in Drive Score of up to 16.8.

Also in February 2024, the Institute for Interdisciplinary Information Sciences at Tsinghua University and Li Auto introduced DriveVLM (its whole process shown in the figure below). A range of images are processed by a large visual language model (VLM) to perform specific chain of thought (CoT) reasoning to produce driving planning results. This large VLM includes a visual encoder and a large language model (LLM).

Due to limitations of VLMs in spatial reasoning and high computing requirements, DriveVLM team proposed DriveVLM-Dual, a hybrid system that combines advantages of DriveVLM and conventional autonomous driving pipelines. DriveVLM-Dual optionally combines DriveVLM with conventional 3D perception and planning modules, such as 3D object detector, occupancy network, and motion planner, allowing the system to achieve 3D localization and high-frequency planning. This dual-system design, similar to slow and fast thinking processes of human brain, can effectively adapt to changing complexity of driving scenarios.

AI and cloud companies attract attention as foundation models emerge.

As AI foundation models emerge, computing power, algorithm and data are indispensable. AI companies (iFLYTEK, SenseTime, Megvii, etc.) that are good at algorithms and have a large reserve of computing power, and cloud computing companies (Inspur, Volcengine, Tencent Cloud, etc.) with powerful intelligent computing centers, come under a spotlight of OEMs.

In the field of AI Foundation Model, SenseTime has deployed cockpit multimodal foundation model SenseChat-Vision, Artificial Intelligence Data Center (AIDC, with computing power of 6000P), and autonomous driving foundation model DriveMLM. In early 2024, SenseTime launched DriveMLM and achieved good results on CARLA, the most authoritative list of closed-loop test. DriveMLM is an intermediate solution between modular and end-to-end solutions and is interpretable.

For collection of autonomous driving corner cases, Volcengine and Haomo.ai work together to use foundation models to generate scenarios and improve annotation efficiency. The cloud service capabilities provided by Volcengine help Haomo.ai to improve the overall pre-annotation efficiency of DriveGPT by 10 times.

In 2023, Tencent released upgraded products and solutions in Intelligent Vehicle Cloud, Intelligent Driving Cloud Map, Intelligent Cockpit and other fields. In terms of computing power, Tencent Intelligent Vehicle Cloud enables 3.2Tbps bandwidth, 3 times higher computing performance, 10 times higher communication performance, and an over 60% increase in computing cluster GPU utilization, providing high-bandwidth, low-latency intelligent computing power support for training foundation models for intelligent driving.

As for training acceleration, Tencent Intelligent Vehicle Cloud combines Angel Training Acceleration Framework, with training speed twice and reasoning speed 1.3 times faster than the industry’s mainstream frameworks. Currently Bosch, NIO, NVIDIA, Mercedes-Benz, and WeRide among others are users of Tencent Intelligent Vehicle Cloud. In 2024, Tencent will further strengthen construction of AI foundation models.

Key Topics Covered:

1 Classification of Autonomous Driving (AD) Algorithms and Common Algorithm Models

1.1 AD System Classification and Software 2.0

1.2 Baidu AD Algorithm Development History

1.3 Tesla AD Algorithm Development History

1.4 Neural Network Model

1.5 Traditional AD AI Algorithms (Small Model)

1.6 Transformer and BEV (Foundation Model)

1.7 End-to-end Foundation Model Cases

2 Overview of AI Foundation Model and Intelligent Computing Center

2.1 AI Foundation Model

2.2 Application of AI Foundation Model in Automotive

2.3 Autonomous Driving (AD) Multimodal Basic Foundation Model

2.4 Intelligent Computing Center

3 Tesla Algorithm and Foundation Model Analysis

3.1 Algorithm Fusion of CNN and Transformer

3.2 Transformer Turns 2D into 3D

3.3 Occupancy Network, Semantic Segmentation and Time-space Sequence

3.4 LaneGCN and Search Tree

3.5 Data Closed Loop and Data Engine

4 AI Algorithms and Foundation Model Providers

4.1 Haomo.ai

4.2 QCraft

4.3 Baidu

4.4 Inspur

4.5 SenseTime

4.6 Huawei

4.7 Unisound

4.8 iFLYTEK

4.9 AISpeech

4.10 Megvii Technology

4.11 Volcengine

4.12 Tencent Cloud

4.13 Other Companies

4.13.1 Banma Zhixing

4.13.2 ThunderSoft

4.13.3 Horizon Robotics’ End-side Deployment of Foundation Model

5 Foundation Model of OEMs

5.1 Xpeng Motor

5.2 Li Auto

5.3 Geely

5.4 BYD

5.5 GM

5.6 Changan Automobile

5.7 Other Auto Enterprises

5.7.1 GWM: All-round Layout of AI Foundation Model

5.7.2 Chery: EXEED STERRA ES Equipped with Cognitive Foundation Model

5.7.3 GAC

5.7.4 SAIC-GM-Wuling

5.7.5 Mercedes-Benz

5.7.6 Volkswagen

5.7.7 Stellantis

5.7.8 PSA

6 Application Trends of Sora and AI Foundation Model in Automotive

6.1 Analysis of Sora Text-to-Video Foundation Model

6.2 Explanation of Sora’s Underlying Algorithm Architecture

6.3 Generative World Model and Intelligent Vehicle Industry

6.4 Application Trends of AI Foundation Model in Automotive

6.5 AI Foundation Model Requirements for Chips

For more information about this report visit https://www.researchandmarkets.com/r/8h7509

About ResearchAndMarkets.com

ResearchAndMarkets.com is the world’s leading source for international market research reports and market data. We provide you with the latest data on international and regional markets, key industries, the top companies, new products and the latest trends.

Contacts

ResearchAndMarkets.com

Laura Wood, Senior Press Manager

press@researchandmarkets.com
For E.S.T Office Hours Call 1-917-300-0470

For U.S./ CAN Toll Free Call 1-800-526-8630

For GMT Office Hours Call +353-1-416-8900

Alex

Next Zai Lab Statement on Executive Management Team’s Agreement on Share Activities »

Previous « Global and China Automotive Smart Surface Research 2023 Featuring 7 Tier 1 Automobile Smart Surface Companies, 22 Core Enterprises in the Supply Chain and Comparison of 14 OEM Smart Surface Models - ResearchAndMarkets.com

Published by

Alex

2 years ago

SEED Medical Launches “Christmas Gift of Health: Year-End Body Check Festival”

HONG KONG SAR - Media OutReach Newswire - 15 December 2025 - As the Christmas…

40 minutes ago

News

HK Pung Saeng Taekwondo Hosts “HK Pung Saeng TaekwonFest 2025” Concludes Successfully

Gathering Athletes from Six Regions to Showcase Taekwondo Spirit and Promote International Exchange and Youth…

50 minutes ago

News

Super Moments in Focus: OPPO Announces Global Winners of the 2025 Photography Awards

SHENZHEN, CHINA - Media OutReach Newswire – 15 December 2025 - OPPO today announced the…

54 minutes ago

News

Relief Therapeutics and NeuroX Complete Business Combination and Form MindMaze Therapeutics

GENEVA, SWITZERLAND - EQS Newswire - 15 December 2025 - MindMaze Therapeutics Holding SA (SIX:…

1 hour ago

News

Halogen Capital Completes RM13.3 Million Funding Round, Led by Kenanga Investment Bank and 500 Global, to Drive Digital Asset Innovation in Malaysia

Kenanga leads the funding round, with participation from global and regional investors.KUALA LUMPUR, MALAYSIA -…

2 hours ago

News

Philips Evnia Joins Forces with Sonic Racing: CrossWorlds to Bring Gamers the Perfect Fusion of Speed and Visual Brilliance

HONG KONG SAR - Media OutReach Newswire - 15 December 2025 - Premium gaming monitor…

5 hours ago