image: Socially aware AI helps autonomous vehicles weave through crowds without collisions.
Credit: Zihan Jiang/Tongji University, Weidong Zhang/Shanghai Jiao Tong University,Henan University
Researchers from Tongji University and Shanghai Jiao Tong University have developed a socially aware prediction-to-control pipeline that lets autonomous vehicles safely navigate dense crowds by anticipating multiple ways pedestrians might move. Instead of betting on a single forecast, their system combines a Social GAN trajectory predictor with a real-time Model Predictive Control (MPC) planner, treating each predicted path as a moving obstacle. In dynamic crowd simulations, the integrated Social GAN+MPC controller achieved zero safety violations and maintained comfortable motion, all while meeting strict real-time computing limits—offering a practical route toward zero-collision autonomous driving in busy urban environments.
Autonomous vehicles will not earn public trust if they hesitate at every crosswalk or, worse, squeeze past pedestrians in ways that feel unsafe. In dense urban spaces, self-driving systems must make split-second decisions while surrounded by people who may suddenly slow down, speed up, or change direction.
In a new study published in Robot Learning, researchers from Tongji University and Shanghai Jiao Tong University present a socially aware prediction-to-control framework that helps autonomous vehicles weave through crowds without collisions. By tightly coupling a Social Generative Adversarial Network (Social GAN) with a constraint-aware Model Predictive Control (MPC) planner, the team shows that it is possible to reason over multiple plausible futures and still meet strict real-time requirements.
Traditional motion planners often assume that pedestrians behave in simple, predictable ways. In practice, people negotiate for space, join or split groups, pause to look at their phones, or yield to others at the last moment. These behaviors are inherently uncertain and multi-modal: at any given instant, several different paths may be equally likely.
For computational reasons, many existing systems compress this rich uncertainty into a single “average” forecast or a conservative worst-case path. While this simplification makes the planner easier to implement, it can also make the vehicle overly cautious—stopping too often—or dangerously overconfident if the realized behavior deviates from the single predicted future.
The new framework positions itself exactly at the interface between learning-based prediction and optimization-based control.
On the prediction side, a Social GAN model takes short histories of pedestrian motion and generates multiple socially consistent future trajectories for each person in the scene. These samples capture distinct possibilities such as “keeps walking straight,” “slows down,” or “yields and turns,” instead of collapsing everything into a single path.
On the control side, an MPC planner computes the vehicle’s next steering and acceleration commands over a short time horizon while respecting physical limits and comfort constraints. The key innovation is to treat each Social GAN trajectory sample as a time-varying dynamic obstacle inside the MPC optimization problem. Rather than smoothing them into a probability map or averaging them away, the planner explicitly considers a curated set of diverse futures when choosing its own path.
“In simple terms, the predictor imagines several socially reasonable futures for every pedestrian, and the controller picks a path that stays safe under all of them,” says author Zihan Jiang. “This lets the vehicle negotiate crowded spaces more like a cautious human driver, without needing an excessively complex controller.”
To test the framework, the team first validated Social GAN’s predictive quality on standard ETH/UCY pedestrian datasets, where it outperformed classic LSTM-based baselines in average and final displacement error. They then placed a robot agent into simulated dense-crowd scenarios and compared two control strategies:
1. A simple reactive “emergency-stop” policy that brakes when pedestrians get too close, and
2. The proposed Social GAN-assisted MPC controller.
Both controllers used the same pedestrian predictions, but only the second treated the multiple trajectory samples as structured input to the optimization problem.
The difference in safety was stark. In repeated simulations, the Social GAN+MPC controller achieved zero safety violations, maintaining an average clearance of about 0.94 meters to the nearest pedestrian—slightly larger than the reactive baseline—while still reaching the goal. The safer behavior came at the cost of only a small increase in travel time (about 0.8 seconds longer) and a modest reduction in path efficiency due to more cautious detours.
Because the controller makes earlier, proactive adjustments, it uses higher acceleration and jerk levels than the simple baseline. However, the study reports that these values remain within commonly accepted comfort limits, indicating that passengers would likely perceive the maneuvers as firm but not aggressive.
Crucially, the entire prediction-to-control loop remains fast enough for real vehicles. Across 40 full cycles, the average end-to-end latency was around 209 milliseconds, with the slowest cycle still comfortably below a 400-millisecond deadline. Social GAN itself required only a few milliseconds per update, while the MPC solver dominated the computation but stayed within typical 100–200 millisecond control intervals.
This paper was published in Robot Learning. Jiang Z, Zhu H, Liu R, Hu X, Zhang W. Social GAN guided multimodal trajectory prediction and MPC for autonomous driving. Robot Learn. 2025(2):0010, https://doi.org/10.55092/rl20250010.
DOI: 10.55092/rl20250010
Journal
Robot Learning
Method of Research
Computational simulation/modeling
Subject of Research
Not applicable
Article Title
Social GAN guided multimodal trajectory prediction and MPC for autonomous driving
Article Publication Date
26-Nov-2025