Projects

Learning Pushing Dynamics for Arbitrary 2D Rigid Bodies

Art Boyarov*, Zichen Zhang*.

We study the problem of learning the pushing dynamics of arbitrary 2D rigid bodies, developing neural network models trained on simulated data collected with a Franka Panda robot. By comparing a shallow MLP to a deeper point-cloud-inspired network, we show that the deeper model better captures the complex motion dynamics of different 2D shapes. Using the learned models in a Model Predictive Path Integral (MPPI) controller, we successfully achieve closed-loop pushing and obstacle avoidance across diverse 2D rigid bodies.

Learning Pushing Dynamics for Arbitrary 2D Rigid Bodies

<em>Simpler is Better:</em> Finding the Best Reward Function in Long Chain-of-Thought Reinforcement Learning for Small Language Models

Luning Wang*, Zichen Zhang*, Junkuan Liu*.

We study three types of reward functions — normal, cosine, and dynamic — for long chain-of-thought reinforcement learning in Small Language Models, and find that the simple normal reward consistently outperforms more complex designs, suggesting that simpler rewards are good enough for eliciting reasoning in smaller models.

<em>Simpler is Better:</em> Finding the Best Reward Function in Long Chain-of-Thought Reinforcement Learning for Small Language Models

VTMo: Unified Visuo-Tactile Transformer Encoder with Mixture-of-Modality-Experts

Zichen Zhang, Peihao Li, Yuan Cheng.

We introduce VTMo, a modular Vision-Touch Transformer encoder that unifies dual-encoder flexibility with fusion-encoder accuracy through a shared self-attention mechanism and modality-specific or cross-modal experts. VTMo supports image-only, touch-only, and vision-touch fusion tasks, offering versatility for speed or accuracy. Our method achieves competitive performance on the Image-to-Touch Retrieval task while reducing training time and computational complexity.

VTMo: Unified Visuo-Tactile Transformer Encoder with Mixture-of-Modality-Experts

Babysitting a Small Language Model through One-Step Tree-of-Thoughts Knowledge Distillation

Anurag Renduchintala*, Adi Mahesh*, Zichen Zhang*, Zimo Si*, Shangjun Meng*, Samuel Fang*.

We introduce the One-Step Tree-of-Thoughts framework, a simplified prompting method that distills multi-step reasoning into a single structured prompt, and demonstrates how knowledge distillation can transfer this reasoning capability from Large Language Models to Small Language Models with much less parameters, enabling significant improvements reasoning performance, beating GPT-4o and GPT-4, as shown on the model's performance on Game of 24.

Babysitting a Small Language Model through One-Step Tree-of-Thoughts Knowledge Distillation

MIA-Sort: Multiplex Chromatin Interaction Analysis by Efficiently Sorting Chromatin Complexes

Zichen Zhang, Minji Kim.

MIA-Sort is a Python bioinformatics tool for efficiently extracting and sorting chromatin complexes from large datasets like Hi-C and Pore-C, enabling researchers to analyze chromatin loops, stripes, jets, and hubs to study loop extrusion.

Zichen "Charlie" Zhang, Minji Kim

MIA-Sort: Multiplex Chromatin Interaction Analysis by Efficiently Sorting Chromatin Complexes