Chuyan Zhou
Junior Student in CS Major
ShanghaiTech University
I’m currently an undergraduate student at ShanghaiTech University majoring in Computer Science & Technology. I am also an undergraduate researcher in the field of NLP, with a focus on LLMs and related areas. Here is my Curriculum Vitae.
Research Interest
Currently, I am most interested in these research topics:
- Efficient LLMs, such as
- Parallel Decoding (Speculative/Jacobi/…)
- Quantization,
- KV Cache Compression/Pruning,
- Distillation…
- Reinforcement Learning & Post-training for LLMs
- Diffusion-related Methods for LLMs (for NAT-ish Decoding)
- Combination of Connectionist and Probabilistic/Symbolic Methods for NLP
- AI for Science, especially AI for Biology where the bold items are which I have been working on. Generally, I am interested in researches in fields involving Natural Language Processing, Machine Learning Systems & Theories, Reinforcement Learning, AI for Biology and Computer Vision.
Language Skills
- English: Fluent, TOEFL iBT 101/120 (2023.08.26), GRE 331/340 (2025.03.08, V161/Q170)
- Japanese: Fluent, JLPT N2 170/180 (2024.07)
- Mandarin: Native
I post blogs here for research, notes and random things such as my traveling:
Photo of Tateyama Mountains. Shown in Toyama City, Toyama Prefecture, Japan, in 2025.01.05.
Publications
Coming soon...
Recent Posts

Paper Reading: Pushdown Layers
The paper reading note for Pushdown Layers.
Facial Keypoint Detection by Regression & Pixelwise Classification
From IMM Face Database (regression) and iBUG dataset (Gaussian heatmap & Pixelwise classification). (CS280 FA22 Project 5 & CS180 Final Project 1/3)
Rectified Flow, explained
A simple explanation for the Rectified Flow framework.CS180: Project Webpage Catalog
The Catalog of Project Showcase Webpages for 2024 Fall UC Berkeley COMPSCI 180 Computer VisionEE229 Lecture Note 2
Definition of Many EntropiesResearch Experience
Undergraduate Researcher
2023.10 - Present
-
(Semantic Dependency Parsing w/ ML) (2025.2 - Present)
- Built some baselines and conducted experiments for the baseline model.
- Worked in a part of paper writing.
- Coming soon…
-
Other Contributions
- (2023.12-2025.1, as a main contributor) Developed a binary classification and regression model to predict packaging efficiency leveraging ESM Encoders for synthesis (inpainting) of gene sequence insertions into AAV2 caspid proteins—with the motif for gene therapy vectors.
- (2024.9-2025.2, as the primary contributor) Investigated the feasibility of a Jacobi decoding method in a continuous fashion; further work (e.g. parallel continuous CoT) is under investigation.
More Projects
Reconstruction and Re-evaluation of SFCNN for PLA Scoring
2025.4 - Present
- (2025.4) Constructed a PyTorch implementation of the model architecture, training, and evaluation of SFCNN (Scoring Function 3D Convolutional Neural Network) for protein-ligand binding affinity prediction.
- (2025.4-6, expected) Developed a novel benchmark for similar models to allow inference directly on the 3D structure of protein-ligand complexes instead of on decoupled protein and ligand structures.
LLM-powered Lecture Generation
2024.8 - 2024.12
- (2024.8-9) Independently developed the backend framework of the lecture generation pipeline using FastAPI as a deployable web service. This backend includes asynchronous task execution via multithreading, task management via an API powered by Redis databases, and a metadata system for managing generated data.
- (2024.9-10) Worked as the main developer to integrate the respective model components on the backend framework.
- (2024.10-11) Developed an additional LLM-powered QA agent based on the backend that interacts with LLMs using a long context of generated lectures and a RAG system to dynamically index grounding sources (e.g., textbooks).