Jiashuo Liu

Research Scientist @ ByteDance Seed Team

liujiashuo77@gmail.com [ Curriculum Vitae ]

Bio

Hi, Im Jiashuo! I'm a Research Scientist at ByteDance Seed Team, focusing on LLM Auto-Evaluation and Pretraining. I'm fascinated by analyzing and improving generalization of AI systems (from small-scale to large-scale).

I got my Ph.D. from Tsinghua CS, advised by Prof. Peng Cui and Prof. Bo Li. I was a visiting student researcher at Prof. Mihaela van der Schaar's group at Cambridge Centre for AI in Medicine, and Prof. Jose H. Blanchet's group at Stanford MS&E. I also work remotely with Prof. Hongseok Namkoong and Prof. Henry Lam at Columbia University. During my Ph.D. time, I develop robust, reliable, and fair machine learning models, with a particular focus on addressing distribution shifts, and bridge the fields of machine learning, operations research, causal inference, and healthcare. For an overview of my research, please refer to our NeurIPS'23 Tutorial, SDM'24 Tutorial, and CoLLAs'24 Tutorial.

[Internship Opportunities] I'm actively looking for interns and collaborators that are interested in Data-Centric AI and Evaluation Algorithms! Interns can be based in Beijing or San Jose. Please contact me if interested!

Recent News:

2025.08.04 KDD 2025 Tutorial
I'll give a tutorial on "Data Heterogeneity Modeling for Trustworthy Machine Learning" at KDD 2025, which will discuss how a better understanding of data heterogeneity improves generalization for both small-scale and large-scale models. See you in Toronto!
2025.06.19, 2025.07.03 Invited Talk at Jump Trading and Ubiquant Trading
I gave a invited talk on "Generalizable Machine Learning via Inductive Modeling of Distribution Shifts" at Jump Trading, Shanghai office and Ubiquant Trading, Beijing Office, respectively.
2025.05.30 First DRO Package
We release the first python library for distributionally robust optimization (DRO), covering 14 DRO formulations and 9 backbone models. Welcome to read our technical report and try our package!

Publications

Most recent publications on Google Scholar. * denotes equal contributions.

Selected
All
DistShift
Data-Centric
Methodology

[20] Data Heterogeneity Modeling for Trustworthy Machine Learning

Jiashuo Liu, Peng Cui

KDD'25: SIGKDD Conference on Knowledge Discovery and Data Mining 2025

Paper

[19] DRO: A Python Library for Distributionally Robust Optimization in Machine Learning

Jiashuo Liu*, Tianyu Wang, Henry Lam, Hongseok Namkoong, Jose Blanchet

Under review

Paper Code

[18] Topology-Aware Dynamic Reweighting for Distribution Shifts on Graph

Weihuang Zheng*, Jiashuo Liu*, Jiaxing Li, Jiayun Wu, Peng Cui, Youyong Kong

ICML'25: International Conference on Machine Learning 2025

Paper

[17] Going Beyond Static: Understanding Shifts with Time-Series Attribution

Jiashuo Liu, Nabeel Seedat, Peng Cui, Mihaela van der Schaar

ICLR'25: International Conference on Learning Representations 2025

Paper

[16] Position: What's the next frontier for Data-centric AI? Data Savvy Agents!

Nabeel Seedat*, Jiashuo Liu*, Mihaela van der Schaar

ICLR'25 Workshop on Navigating and Addressing Data Problems for Foundation Models

Paper

[15] Towards Human-Guided, Data-Centric LLM Co-Pilots

Evgeny Saveliev*, Jiashuo Liu*, Nabeel Seedat*, Anders Boyd, Mihaela van der Schaar

ICLR'25 Workshop on Navigating and Addressing Data Problems for Foundation Models
Under review at DMLR

Paper

[14] Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift

Jiayun Wu, Jiashuo Liu, Peng Cui, Zhiwei Steven Wu

NeurIPS'24: Neural Information Processing Systems 2024

Paper Code

[13] LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts

Yibo Zeng*, Jiashuo Liu*, Henry Lam, Hongseok Namkoong

NeurIPS'24 Workshop on Table Representation Learning

Paper Code Media

[12] Stability Evaluation of Large Language Models via Distributional Perturbation Analysis

Jiashuo Liu, Jiajin Li, Peng Cui, Jose Blanchet

NeurIPS'24 Workshop on Red Teaming GenAI

Paper

[11] On the Need of a Modeling Language for Distribution Shifts: Illustrations on Tabular Datasets

Jiashuo Liu*, Tianyu Wang*, Peng Cui, Hongseok Namkoong

INFORMS'24 Workshop on Data Science, 2024 (full paper presentation)
NeurIPS'23: Advances in Neural Information Processing Systems, Datasets and Benchmarks Track, 2023
Selected as NeurIPS Favorite Papers/Presentations (9/3500+) by Two Sigma
Major Revision at Management Science

Paper Code Python Package Media

[10] Stability Evaluation via Distributional Perturbation Analysis

(α-β order) Jose Blanchet*, Peng Cui*, Jiajin Li*, Jiashuo Liu*

ICML'24: International Conference on Machine Learning 2024
Invited talk at Advances in Data-Driven Distributionally Robust Optimization, INFORMS'24 Annual Meeting

Paper

[9] Geometry-Calibrated DRO: Combating Over-Pessimism with Free Energy Implications

Jiashuo Liu, Jiayun Wu, Tianyu Wang, Hao Zou, Bo Li, Peng Cui

ICML'24: International Conference on Machine Learning 2024
Short version at NeurIPS'23, Workshop on Distribution Shifts

Paper

[8] Enhancing Distributional Stability among Sub-Populations

Jiashuo Liu, Jiayun Wu, Jie Peng, Xiaoyu Wu, Yang Zheng, Bo Li, Peng Cui

AISTATS'24: International Conference on Artificial Intelligence and Statistics 2024

Paper

[7] Towards Robust Out-of-Distribution Generalization Bounds via Sharpness

Yingtian Zou, Kenji Kawaguchi, Yingnan Liu, Jiashuo Liu, Mong-Li Lee, Wynne Hsu

ICLR'24: International Conference on Learning Representations 2024 ((Spotlight))

Paper

[6] Measure the Predictive Heterogeneity

Jiashuo Liu*, Jiayun Wu*, Renjie Pi, Renzhe Xu, Xingxuan Zhang, Bo Li, Peng Cui

ICLR'23: International Conference on Learning Representations 2023

Paper

[5] Distributionally Robust Learning with Stable Adversarial Training

Jiashuo Liu, Zheyan Shen, Peng Cui, Linjun Zhou, Kun Kuang, Bo Li

TKDE'22: IEEE Transactions on Knowledge and Data Engineering 2022

Paper Code

[4] Distributionally Robust Optimization with Data Geometry

Jiashuo Liu*, Jiayun Wu*, Bo Li, Peng Cui

NeurIPS'22: Neural Information Processing Systems 2022 (Spotlight)

Paper

[3] Kernelized Heterogeneous Risk Minimization

Jiashuo Liu*, Zheyuan Hu*, Peng Cui, Bo Li, Zheyan Shen

NeurIPS'21: Neural Information Processing Systems 2021

Paper Code

[2] Heterogeneous Risk Minimization

Jiashuo Liu, Zheyuan Hu, Peng Cui, Bo Li, Zheyan Shen

ICML'21: International Conference on Machine Learning 2021

Paper Code

[1] Stable Adversarial Learning under Distributional Shifts

Jiashuo Liu, Zheyan Shen, Peng Cui, Linjun Zhou, Kun Kuang, Bo Li, Yishi Lin

AAAI'21: AAAI Conference on Artificial Intelligence

Paper Code

[11] Topology-Aware Dynamic Reweighting for Distribution Shifts on Graph

Weihuang Zheng*, Jiashuo Liu*, Jiaxing Li, Jiayun Wu, Peng Cui, Youyong Kong

ICML'25: International Conference on Machine Learning 2025

Paper

[10] Going Beyond Static: Understanding Shifts with Time-Series Attribution

Jiashuo Liu, Nabeel Seedat, Peng Cui, Mihaela van der Schaar

ICLR'25: International Conference on Learning Representations 2025

Paper

[9] Towards Out-of-Distribution Generalization: A Survey

Jiashuo Liu*, Zheyan Shen*, Yue He, Xingxuan Zhang, Renzhe Xu, Han Yu, Peng Cui

Survey Paper

Paper

[8] AdaptSel: Adaptive Selection of Biased and Debiased Recommendation Models for Varying Test Environments

Zimu Wang, Hao Zou, Jiashuo Liu, Jiayun Wu, Pengfei Tian, Yue He, Peng Cui

TKDD'24: Transactions on Knowledge Discovery from Data

Paper

[7] Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift

Jiayun Wu, Jiashuo Liu, Peng Cui, Zhiwei Steven Wu

NeurIPS'24: Neural Information Processing Systems 2024

Paper Code

[6] LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts

Yibo Zeng*, Jiashuo Liu*, Henry Lam, Hongseok Namkoong

NeurIPS'24 Workshop on Table Representation Learning

Paper Code Media

[5] Stability Evaluation of Large Language Models via Distributional Perturbation Analysis

Jiashuo Liu, Jiajin Li, Peng Cui, Jose Blanchet

NeurIPS'24 Workshop on Red Teaming GenAI

Paper

[4] On the Need of a Modeling Language for Distribution Shifts: Illustrations on Tabular Datasets

Jiashuo Liu*, Tianyu Wang*, Peng Cui, Hongseok Namkoong

Paper Code Python Package Media

[3] Stability Evaluation via Distributional Perturbation Analysis

(α-β order) Jose Blanchet*, Peng Cui*, Jiajin Li*, Jiashuo Liu*

ICML'24: International Conference on Machine Learning 2024
Invited talk at Advances in Data-Driven Distributionally Robust Optimization, INFORMS'24 Annual Meeting

Paper

[2] Enhancing Distributional Stability among Sub-Populations

Jiashuo Liu, Jiayun Wu, Jie Peng, Xiaoyu Wu, Yang Zheng, Bo Li, Peng Cui

AISTATS'24: International Conference on Artificial Intelligence and Statistics 2024

Paper

[1] Towards Robust Out-of-Distribution Generalization Bounds via Sharpness

Yingtian Zou, Kenji Kawaguchi, Yingnan Liu, Jiashuo Liu, Mong-Li Lee, Wynne Hsu

ICLR'24: International Conference on Learning Representations 2024 ((Spotlight))

Paper

[8] Data Heterogeneity Modeling for Trustworthy Machine Learning

Jiashuo Liu, Peng Cui

KDD'25: SIGKDD Conference on Knowledge Discovery and Data Mining 2025

Paper

[7] Exploring and Exploiting Data Heterogeneity in Recommendation

Zimu Wang, Jiashuo Liu, Hao Zou, Xingxuan Zhang, Yue He, Dongxu Liang, Peng Cui

TKDD'25: Transactions on Knowledge Discovery from Data

Paper

[6] Going Beyond Static: Understanding Shifts with Time-Series Attribution

Jiashuo Liu, Nabeel Seedat, Peng Cui, Mihaela van der Schaar

ICLR'25: International Conference on Learning Representations 2025

Paper

[5] Position: What's the next frontier for Data-centric AI? Data Savvy Agents!

Nabeel Seedat*, Jiashuo Liu*, Mihaela van der Schaar

ICLR'25 Workshop on Navigating and Addressing Data Problems for Foundation Models

Paper

[4] Towards Human-Guided, Data-Centric LLM Co-Pilots

Evgeny Saveliev*, Jiashuo Liu*, Nabeel Seedat*, Anders Boyd, Mihaela van der Schaar

ICLR'25 Workshop on Navigating and Addressing Data Problems for Foundation Models
Under review at DMLR

Paper

[3] On the Need of a Modeling Language for Distribution Shifts: Illustrations on Tabular Datasets

Jiashuo Liu*, Tianyu Wang*, Peng Cui, Hongseok Namkoong

Paper Code Media

[2] Domain-wise Data Acquisition to Improve Performance under Distribution Shift

Yue He, Dongbai Li, Pengfei Tian, Han Yu, Jiashuo Liu, Hao Zou, Peng Cui

ICML'24: International Conference on Machine Learning 2024

Paper

[1] Measure the Predictive Heterogeneity

Jiashuo Liu*, Jiayun Wu*, Renjie Pi, Renzhe Xu, Xingxuan Zhang, Bo Li, Peng Cui

ICLR'23: International Conference on Learning Representations 2023

Paper

[9] DRO: A Python Library for Distributionally Robust Optimization in Machine Learning

Jiashuo Liu*, Tianyu Wang, Henry Lam, Hongseok Namkoong, Jose Blanchet

Under review

Paper Code

[8] Topology-Aware Dynamic Reweighting for Distribution Shifts on Graph

Weihuang Zheng*, Jiashuo Liu*, Jiaxing Li, Jiayun Wu, Peng Cui, Youyong Kong

ICML'25: International Conference on Machine Learning 2025

Paper

[7] Geometry-Calibrated DRO: Combating Over-Pessimism with Free Energy Implications

Jiashuo Liu, Jiayun Wu, Tianyu Wang, Hao Zou, Bo Li, Peng Cui

ICML'24: International Conference on Machine Learning 2024
Short version at NeurIPS'23, Workshop on Distribution Shifts

Paper

[6] Enhancing Distributional Stability among Sub-Populations

Jiashuo Liu, Jiayun Wu, Jie Peng, Xiaoyu Wu, Yang Zheng, Bo Li, Peng Cui

AISTATS'24: International Conference on Artificial Intelligence and Statistics 2024

Paper

[5] Distributionally Robust Learning with Stable Adversarial Training

Jiashuo Liu, Zheyan Shen, Peng Cui, Linjun Zhou, Kun Kuang, Bo Li

TKDE'22: IEEE Transactions on Knowledge and Data Engineering 2022

Paper Code

[4] Distributionally Robust Optimization with Data Geometry

Jiashuo Liu*, Jiayun Wu*, Bo Li, Peng Cui

NeurIPS'22: Neural Information Processing Systems 2022 (Spotlight)

Paper

[3] Kernelized Heterogeneous Risk Minimization

Jiashuo Liu*, Zheyuan Hu*, Peng Cui, Bo Li, Zheyan Shen

NeurIPS'21: Neural Information Processing Systems 2021

Paper Code

[2] Heterogeneous Risk Minimization

Jiashuo Liu, Zheyuan Hu, Peng Cui, Bo Li, Zheyan Shen

ICML'21: International Conference on Machine Learning 2021

Paper Code

[1] Stable Adversarial Learning under Distributional Shifts

Jiashuo Liu, Zheyan Shen, Peng Cui, Linjun Zhou, Kun Kuang, Bo Li, Yishi Lin

AAAI'21: AAAI Conference on Artificial Intelligence

Paper Code

[32] Data Heterogeneity Modeling for Trustworthy Machine Learning

Jiashuo Liu, Peng Cui

KDD'25: SIGKDD Conference on Knowledge Discovery and Data Mining 2025

Paper

[31] DRO: A Python Library for Distributionally Robust Optimization in Machine Learning

Jiashuo Liu*, Tianyu Wang, Henry Lam, Hongseok Namkoong, Jose Blanchet

Under review

Paper Code

[30] Exploring and Exploiting Data Heterogeneity in Recommendation

Zimu Wang, Jiashuo Liu, Hao Zou, Xingxuan Zhang, Yue He, Dongxu Liang, Peng Cui

TKDD'25: Transactions on Knowledge Discovery from Data

Paper

[29] Topology-Aware Dynamic Reweighting for Distribution Shifts on Graph

Weihuang Zheng*, Jiashuo Liu*, Jiaxing Li, Jiayun Wu, Peng Cui, Youyong Kong

ICML'25: International Conference on Machine Learning 2025

Paper

[28] Going Beyond Static: Understanding Shifts with Time-Series Attribution

Jiashuo Liu, Nabeel Seedat, Peng Cui, Mihaela van der Schaar

ICLR'25: International Conference on Learning Representations 2025

Paper

[27] Position: What's the next frontier for Data-centric AI? Data Savvy Agents!

Nabeel Seedat*, Jiashuo Liu*, Mihaela van der Schaar

ICLR'25 Workshop on Navigating and Addressing Data Problems for Foundation Models

Paper

[26] Towards Human-Guided, Data-Centric LLM Co-Pilots

Evgeny Saveliev*, Jiashuo Liu*, Nabeel Seedat*, Anders Boyd, Mihaela van der Schaar

ICLR'25 Workshop on Navigating and Addressing Data Problems for Foundation Models
Under review at DMLR

Paper

[25] Towards Out-of-Distribution Generalization: A Survey

Jiashuo Liu*, Zheyan Shen*, Yue He, Xingxuan Zhang, Renzhe Xu, Han Yu, Peng Cui

Survey Paper

Paper

[24] AdaptSel: Adaptive Selection of Biased and Debiased Recommendation Models for Varying Test Environments

Zimu Wang, Hao Zou, Jiashuo Liu, Jiayun Wu, Pengfei Tian, Yue He, Peng Cui

TKDD'24: Transactions on Knowledge Discovery from Data

Paper

[23] Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift

Jiayun Wu, Jiashuo Liu, Peng Cui, Zhiwei Steven Wu

NeurIPS'24: Neural Information Processing Systems 2024

Paper Code

[22] LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts

Yibo Zeng*, Jiashuo Liu*, Henry Lam, Hongseok Namkoong

NeurIPS'24 Workshop on Table Representation Learning

Paper Code Media

[21] Stability Evaluation of Large Language Models via Distributional Perturbation Analysis

Jiashuo Liu, Jiajin Li, Peng Cui, Jose Blanchet

NeurIPS'24 Workshop on Red Teaming GenAI

Paper

[20] On the Need of a Modeling Language for Distribution Shifts: Illustrations on Tabular Datasets

Jiashuo Liu*, Tianyu Wang*, Peng Cui, Hongseok Namkoong

Paper Code Media

[19] Stability Evaluation via Distributional Perturbation Analysis

(α-β order) Jose Blanchet*, Peng Cui*, Jiajin Li*, Jiashuo Liu*

ICML'24: International Conference on Machine Learning 2024
Invited talk at Advances in Data-Driven Distributionally Robust Optimization, INFORMS'24 Annual Meeting

Paper

[18] Geometry-Calibrated DRO: Combating Over-Pessimism with Free Energy Implications

Jiashuo Liu, Jiayun Wu, Tianyu Wang, Hao Zou, Bo Li, Peng Cui

ICML'24: International Conference on Machine Learning 2024
Short version at NeurIPS'23, Workshop on Distribution Shifts

Paper

[17] Enhancing Distributional Stability among Sub-Populations

Jiashuo Liu, Jiayun Wu, Jie Peng, Xiaoyu Wu, Yang Zheng, Bo Li, Peng Cui

AISTATS'24: International Conference on Artificial Intelligence and Statistics 2024

Paper

[16] Domain-wise Data Acquisition to Improve Performance under Distribution Shift

Yue He, Dongbai Li, Pengfei Tian, Han Yu, Jiashuo Liu, Hao Zou, Peng Cui

ICML'24: International Conference on Machine Learning 2024

Paper

[15] Distributionally Generative Augmentation for Fair Facial Attribute Classification

Fengda Zhang, Qianpei He, Kun Kuang, Jiashuo Liu, Long Chen, Chao Wu, Jun Xiao, Hanwang Zhang

CVPR'24: Conference on Computer Vision and Pattern Recognition 2024

Paper

[14] Rethinking the Evaluation Protocol of Domain Generalization

Han Yu, Xingxuan Zhang, Renzhe Xu, Jiashuo Liu, Yue He, Peng Cui

CVPR'24: Conference on Computer Vision and Pattern Recognition 2024

Paper

[13] Towards Robust Out-of-Distribution Generalization Bounds via Sharpness

Yingtian Zou, Kenji Kawaguchi, Yingnan Liu, Jiashuo Liu, Mong-Li Lee, Wynne Hsu

ICLR'24: International Conference on Learning Representations 2024 ((Spotlight))

Paper

[12] Offline Policy Evaluation in Large Action Spaces via Outcome-Oriented Action Grouping

Jie Peng, Hao Zou, Jiashuo Liu, Shaoming Li, Yibao Jiang, Jian Pei, Peng Cui

WWW'23: The ACM Web Conference 2023

Paper

[11] Measure the Predictive Heterogeneity

Jiashuo Liu*, Jiayun Wu*, Renjie Pi, Renzhe Xu, Xingxuan Zhang, Bo Li, Peng Cui

ICLR'23: International Conference on Learning Representations 2023

Paper

[10] Distributionally Robust Learning with Stable Adversarial Training

Jiashuo Liu, Zheyan Shen, Peng Cui, Linjun Zhou, Kun Kuang, Bo Li

TKDE'22: IEEE Transactions on Knowledge and Data Engineering 2022

Paper Code

[9] Distributionally Robust Optimization with Data Geometry

Jiashuo Liu*, Jiayun Wu*, Bo Li, Peng Cui

NeurIPS'22: Neural Information Processing Systems 2022 (Spotlight)

Paper

[8] Towards the ultimate PMT waveform analysis for neutrino and dark matter experiments

Dacheng Xu, Benda Xu, Erjin Bao, Yiyang Wu, Aiqiang Zhang, Yuyi Wang, Geliang Zhang, Yu Xu, Ziyi Guo, Jihui Pei, Hanyang Mao, Jiashuo Liu, Zhe Wang, Shaomin Chen

JINST'22: Journal of Instrumentation 2022

[7] Invariant Preference Learning for General Debiasing in Recommendation

Zimu Wang, Yue He, Jiashuo Liu, Wenchao Zou, Philip Yu, Peng Cui

KDD'22: SIGKDD Conference on Knowledge Discovery and Data Mining 2022

Paper

[6] Kernelized Heterogeneous Risk Minimization

Jiashuo Liu*, Zheyuan Hu*, Peng Cui, Bo Li, Zheyan Shen

NeurIPS'21: Neural Information Processing Systems 2021

Paper Code

[5] Heterogeneous Risk Minimization

Jiashuo Liu, Zheyuan Hu, Peng Cui, Bo Li, Zheyan Shen

ICML'21: International Conference on Machine Learning 2021

Paper Code

[4] Stable Adversarial Learning under Distributional Shifts

Jiashuo Liu, Zheyan Shen, Peng Cui, Linjun Zhou, Kun Kuang, Bo Li, Yishi Lin

AAAI'21: AAAI Conference on Artificial Intelligence

Paper Code

[3] Triple Generative Adversarial Networks

Chongxuan Li, Kun Xu, Jun Zhu, Jiashuo Liu, Bo Zhang

TPAMI'21: Transactions on Pattern Analysis and Machine Intelligence 2021

[2] Signed Graph Neural Network with Latent Groups

Haoxin Liu, Ziwei Zhang, Peng Cui, Yafeng Zhang, Qiang Cui, Jiashuo Liu, Wenwu Zhu

KDD'21: SIGKDD Conference on Knowledge Discovery and Data Mining 2021

Paper

[1] Stable Learning via Differentiated Variable Decorrelation

Zheyan Shen, Peng Cui, Jiashuo Liu, Tong Zhang, Bo Li, Zhitang Chen

KDD'20: SIGKDD Conference on Knowledge Discovery and Data Mining 2020

Paper

Invited Talks

[7] Generalizable Machine Learning via Inductive Modeling of Distribution Shifts

Speaker: Jiashuo Liu

Ubiquant, Beijing office

Slides

[6] Generalizable Machine Learning via Inductive Modeling of Distribution Shifts

Speaker: Jiashuo Liu

Jump Trading, Shanghai office

Slides

[5] An Inductive Modeling of Distribution Shifts Enhances Trustworthy AI

Speaker: Jiashuo Liu

AI Trustworthiness and Risk Assessment Scientific Seminars (ATRASS#5)

Slides Media

[4] Stability Evaluation via Distributional Perturbation Analysis.

Speaker: Jiashuo Liu

INFORMS'24 Annual Meeting, Seattle, US
Advances in Data-Driven Distributionally Robust Optimization

Slides

[3] Data Heterogeneity Analysis for Out-of-Distribution Generalization

Speaker: Peng Cui, Jiashuo Liu

CoLLAs'24 Tutorial: Conference on Lifelong Learning Agents 2024, Pisa, Italy

Slides

[2] Model the Data Heterogeneity for Out-of-Distribution Generalization

Speaker: Peng Cui, Jiashuo Liu, Bo Li, Renzhe Xu

SDM'24 Tutorial: SIAM International Conference on Data Mining 2024, Huston, US

Slides

[1] Modeling & Exploiting Data Heterogeneity under Distribution Shifts

Speaker: Jiashuo Liu, Tiffany (Tianhui) Cai, Peng Cui, Hongseok Namkoong

NeurIPS'23 Tutorial: Neural Information Processing Systems 2023, New Orleans, US
Selected as Favorite Papers/Presentations (9/3500+) by Two Sigma

Slides Video Media

Achievements

Awards
China Patents

Tsinghua University Outstanding Doctoral Dissertation Award (Top-9 at Tsinghua CS), 2025

Outstanding Doctoral Graduate of Tsinghua University (the highest honor for graduates, Top-4 at Tsinghua CS), 2025

INFORMS Data Science Student Scholarship , 2024

Excellent Comprehensive Scholarship of Tsinghua University , for Ph.D. student, 2022, 2023, 2024

National Scholarship (Nationwide, Top 1%), for Ph.D. student, 2022

Apple Scholars in AI/ML Nomination (Top-2 in Tsinghua), for Ph.D. student, 2021

Excellent Undergraduate (Top 10% in Tsinghua), 2020

Excellent Academic Scholarship , for undergraduates, 2018, 2019

Excellent Comprehensive Scholarship (Top 5% in Tsinghua), for undergraduates, 2017

Freshman Scholarship (2nd Grade) (Top 0.01% in Nei Mongol), for undergraduates, 2016

Distribution robustness adversarial learning method

Peng Cui, Jiashuo Liu, filed August 30, 2020, issued December 13, 2022.

Certificate

Invariant learning method and device based on heterogeneity hybrid data

Peng Cui, Jiashuo Liu, filed April 28, 2021, and issued January 31, 2023.

Certificate

Services

Journal reviewer: Operations Research, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), IEEE Transactions on Knowledge and Data Engineering (TKDE), IEEE Transactions on Multimedia (TMM)

Conference reviewer / Program committee: ICLR (2025, 2024), NeurIPS (2024, 2023), NeurIPS Datasets & Benchmark (2024), ICML (2024,2023,2022), UAI (2024,2023,2022), AAAI (2022), IJCAI (2023,2022), CVPR (2024,2023, 2022), ECCV (2024), ICCV (2023), CoLLAs (2024,2023,2022), AISTATS (2025,2024,2023,2021), ICDM (2024), NeurIPS DistShift (2023)

Teaching:
Machine Learning Summer Camp for Primary & Middle School Students at Cambridge University (Summer 2024, with Prof. Mihaela van der Schaar)
Software Engineering at Tsinghua University (Fall 2019, 2020, 2021, 2022, Spring 2022, 2023 TA)
Object-oriented Programming at Tsinghua University (Summer 2022, TA)

Vitæ

Full Resume in PDF.

ByteDance July 2025-

Research Scientist
Seed Team
Tsinghua University 2020 - 2025

Ph.D. Student
Computer Science
with Prof. Peng Cui and Bo Li
Cambridge University June 2024 - Nov 2024

Visiting Student Researcher
AI in Medicine Center
with Prof. Mihaela van der Schaar
Stanford University Oct 2023 - Apr 2024

Visiting Student Researcher
Management Science and Engineering
with Prof. Jose Blanchet
Columbia Business School Jan 2023 - 2024

Decision, Risk, and Operations division
Remotely work with Prof. Hongseok Namkoong
Sogou July-Sep 2017

Research Assistant, Intern
Search Engine Group
Tsinghua University 2016 - 2020

B.Sc. Student
Computer Science and Technology
Excellent Undergraduate (Top 10%)