Robotics
♻ ★ NMPCB: A Lightweight and Safety-Critical Motion Control Framework for Ackermann Mobile Robot
In multi-obstacle environments, real-time performance and safety in robot
motion control have long been challenging issues, as conventional methods often
struggle to balance the two. In this paper, we propose a novel motion control
framework composed of a Neural network-based path planner and a Model
Predictive Control (MPC) controller based on control Barrier function (NMPCB) .
The planner predicts the next target point through a lightweight neural network
and generates a reference trajectory for the controller. In the design of the
controller, we introduce the dual problem of control barrier function (CBF) as
the obstacle avoidance constraint, enabling it to ensure robot motion safety
while significantly reducing computation time. The controller directly outputs
control commands to the robot by tracking the reference trajectory. This
framework achieves a balance between real-time performance and safety. We
validate the feasibility of the framework through numerical simulations and
real-world experiments.
♻ ★ Efficient Manipulation-Enhanced Semantic Mapping With Uncertainty-Informed Action Selection
Service robots operating in cluttered human environments such as homes,
offices, and schools cannot rely on predefined object arrangements and must
continuously update their semantic and spatial estimates while dealing with
possible frequent rearrangements. Efficient and accurate mapping under such
conditions demands selecting informative viewpoints and targeted manipulations
to reduce occlusions and uncertainty. In this work, we present a
manipulation-enhanced semantic mapping framework for occlusion-heavy shelf
scenes that integrates evidential metric-semantic mapping with
reinforcement-learning-based next-best view planning and targeted action
selection. Our method thereby exploits uncertainty estimates from Dirichlet and
Beta distributions in the map prediction networks to guide both active sensor
placement and object manipulation, focusing on areas with high uncertainty and
selecting actions with high expected information gain. Furthermore, we
introduce an uncertainty-informed push strategy that targets occlusion-critical
objects and generates minimally invasive actions to reveal hidden regions by
reducing overall uncertainty in the scene. The experimental evaluation shows
that our framework enables to accurately map cluttered scenes, while
substantially reducing object displacement and achieving a 95% reduction in
planning time compared to the state-of-the-art, thereby realizing real-world
applicability.
♻ ★ Co-Design of Soft Gripper with Neural Physics
For robot manipulation, both the controller and end-effector design are
crucial. Soft grippers are generalizable by deforming to different geometries,
but designing such a gripper and finding its grasp pose remains challenging. In
this paper, we propose a co-design framework that generates an optimized soft
gripper's block-wise stiffness distribution and its grasping pose, using a
neural physics model trained in simulation. We derived a uniform-pressure
tendon model for a flexure-based soft finger, then generated a diverse dataset
by randomizing both gripper pose and design parameters. A neural network is
trained to approximate this forward simulation, yielding a fast, differentiable
surrogate. We embed that surrogate in an end-to-end optimization loop to
optimize the ideal stiffness configuration and best grasp pose. Finally, we
3D-print the optimized grippers of various stiffness by changing the structural
parameters. We demonstrate that our co-designed grippers significantly
outperform baseline designs in both simulation and hardware experiments. More
info: http://yswhynot.github.io/codesign-soft/
♻ ★ NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving
Integrating General Models (GMs) such as Large Language Models (LLMs), with
Specialized Models (SMs) in autonomous driving tasks presents a promising
approach to mitigating challenges in data diversity and model capacity of
existing specialized driving models. However, this integration leads to
problems of asynchronous systems, which arise from the distinct characteristics
inherent in GMs and SMs. To tackle this challenge, we propose NetRoller, an
adapter that incorporates a set of novel mechanisms to facilitate the seamless
integration of GMs and specialized driving models. Specifically, our mechanisms
for interfacing the asynchronous GMs and SMs are organized into three key
stages. NetRoller first harvests semantically rich and computationally
efficient representations from the reasoning processes of LLMs using an early
stopping mechanism, which preserves critical insights on driving context while
maintaining low overhead. It then applies learnable query embeddings,
nonsensical embeddings, and positional layer embeddings to facilitate robust
and efficient cross-modality translation. At last, it employs computationally
efficient Query Shift and Feature Shift mechanisms to enhance the performance
of SMs through few-epoch fine-tuning. Based on the mechanisms formalized in
these three stages, NetRoller enables specialized driving models to operate at
their native frequencies while maintaining situational awareness of the GM.
Experiments conducted on the nuScenes dataset demonstrate that integrating GM
through NetRoller significantly improves human similarity and safety in
planning tasks, and it also achieves noticeable precision improvements in
detection and mapping tasks for end-to-end autonomous driving. The code and
models are available at https://github.com/Rex-sys-hk/NetRoller .
comment: This work has been submitted to the IEEE for possible publication