Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

About me

Website Redesign Implementation Plan

Jupyter notebook markdown generator

Posts

What’s in Pass@K?

11 minute read

Published: January 30, 2026

Pass@k is ubiquitous in evaluating reasoning models, but the metric is more subtle than it appears. Computing it correctly requires the unbiased estimator, and the nonlinearity of pass@k means it effectively upweights hard problems compared to pass@1.

Implementing Process Rewards in VeRL

4 minute read

Published: January 10, 2026

Using process rewards in VeRL requires advantage estimators that preserve token-level structure. Most standard algorithms collapse rewards to scalars, defeating the purpose of fine-grained credit assignment.

Understanding Length Dynamics in RL Training

37 minute read

Published: December 21, 2025

An empirical investigation into what drives output length growth during RL training, revealing that dataset difficulty composition is the primary driver behind the ‘overthinking’ phenomenon.

(Zoey) Sha Li

Sitemap

Pages

Page Not Found

Archive Layout with Content

Blog

Posts by Category

Posts by Collection

CV

Markdown

Page not in menu

Misc

Page Archive

Portfolio

Publications/Projects

Publications

Sitemap

Posts by Tags

Talk map

Talks and presentations

Teaching

Terms and Privacy Policy

Blog posts

Website Redesign Implementation Plan

Jupyter notebook markdown generator

Posts

What’s in Pass@K?

Implementing Process Rewards in VeRL

Understanding Length Dynamics in RL Training

portfolio

Portfolio item number 1

Portfolio item number 2

publications