Weihao Kong |
I am currently a Research Scientist at Google. Previously I was a postdoctoral researcher at University of Washington working with Sham M. Kakade. I got my PhD in 2019 from Stanford University advised by Gregory Valiant. My thesis can be found here. I got my bachelor degree from Shanghai Jiao Tong University and did undergraduate research with Wu-Jun Li.
Research interests:
- Differential privacy, robust machine learning, time series forecasting.
CV | · | Teaching | · | Talks |
Publications
-
Estimating Optimal Policy Value in Linear Contextual Bandits Beyond Gaussianity
Jonathan Lee, Weihao Kong, Aldo Pacchiano, Vidya Muthukumar, Emma Brunskill.
TMLR 2024 -
Transformers can optimally learn regression mixture models
Reese Pathak, Rajat Sen, Weihao Kong, Abhimanyu Das.
To appear in ICLR 2024 -
A Combinatorial Approach to Robust PCA
Weihao Kong*, Mingda Qiao*, Rajat Sen*.
ITCS 2024 -
Label Robust and Differentially Private Linear Regression: Computational and Statistical Efficiency
Xiyang Liu, Prateek Jain, Weihao Kong, Sewoong Oh, Arun Suggala.
NeurIPS 2023 -
Long-term Forecasting with TiDE: Time-series Dense Encoder
Abhimanyu Das*, Weihao Kong*, Andrew Leach*, Shaan Mathur*, Rajat Sen*, Rose Yu*.
TMLR 2023 -
Efficient List-Decodable Regression using Batches
Abhimanyu Das*, Ayush Jain*, Weihao Kong*, Rajat Sen*.
ICML 2023 -
Dirichlet Proportions Model for Hierarchically Coherent Probabilistic Forecasting
Abhimanyu Das*, Weihao Kong*, Biswajit Paria* , Rajat Sen*.
UAI 2023 -
Blackbox optimization of unimodal functions
Ashok Cutkosky*, Abhimanyu Das*, Weihao Kong*, Chansoo Lee* , Rajat Sen*.
UAI 2023 -
Trimmed Maximum Likelihood Estimation for Robust Learning in Generalized Linear Models
Pranjal Awasthi*, Abhimanyu Das*, Weihao Kong*, Rajat Sen*.
NeurIPS 2022Trimmed MLE achieves near-optimal error rate for Generalized Linear Models under adversarial corruptions.
-
DP-PCA: Statistically Optimal and Differentially Private PCA
Xiyang Liu, Weihao Kong, Prateek Jain, Sewoong Oh.
NeurIPS 2022First differentially private PCA algorithm with $\tilde{O} (d)$ sample complexity for sub-Gaussian data.
-
Fisher-Pitman permutation tests based on nonparametric Poisson mixtures with application to single cell genomics
Zhen Miao, Weihao Kong, Ramya Korlakai Vinayak, Wei Sun, Fang Han.
Journal of the American Statistical Association, 2022We prove nonparametric MLE achieves minimax optimal rate for nonparametric mixture of Poisson distributions.
-
Differential Privacy and Robust Statistics in High Dimensions
Xiyang Liu, Weihao Kong, Sewoong Oh.
COLT 2022A general receipe for minimax optimal algorithm (exponential time) on differentially private and robust mean estimation, linear regression, covariance estimation, PCA.
-
Robust and Differentially Private Mean Estimation
Xiyang Liu, Weihao Kong, Sham M. Kakade, Sewoong Oh.
NeurIPS 2021First differentially private robust mean estimation algorithm for high dimensional distributions.
-
SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics
Jonathan Hayase, Weihao Kong, Raghav Somani, Sewoong Oh.
ICML 2021. -
Online Model Selection for Reinforcement Learning with Function Approximation
Jonathan N. Lee, Aldo Pacchiano, Vidya Muthukumar, Weihao Kong, Emma Brunskill.
AISTATS 2021A meta-algorithm that adapts to the optimal model complexity with regret bound $\tilde{O}(d_*T^{2/3})$.
-
Robust Meta-learning for Mixed Linear Regression with Small Batches [Video]
Weihao Kong, Raghav Somani, Sham M. Kakade, Sewoong Oh.
NeurIPS 2020Smooth time/sample complexity trade-off between the number of tasks and the dataset size of each task. Algorithmic contribution includes an optimal robust PCA algorithm.
-
Meta-Learning for Mixed Linear Regression [Video]
Weihao Kong, Raghav Somani, Zhao Song, Sham M. Kakade, Sewoong Oh.
ICML 2020First poly time/sample complexity algorithm for $k$-mixture linear regression by utilizing a small set of tasks each with $\tilde{O}(\sqrt{k})$ examples.
-
Optimal Estimation of Change in a Population of Parameters
Ramya Korlakai Vinayak, Weihao Kong, Sham M. Kakade.
ManuscriptEstimate the distribution of treatment effects.
-
Sublinear Optimal Policy Value Estimation in Contextual Bandits [Video]
Weihao Kong, Gregory Valiant, Emma Brunskill.
AISTATS 2020.Estimate optimal policy value of contextual bandits with $K$ disjoint $d$-dimensional arms in $\tilde{O} (\sqrt{d}K)$ rounds, minimax optimal up to log factors.
-
Maximum Likelihood Estimation for Learning Populations of Parameters
Ramya Korlakai Vinayak, Weihao Kong, Gregory Valiant, Sham M. Kakade.
ICML 2019.$n$ different coins, each tossed $t$ times, estimate distribution of coin biases with Wasserstein error $O(1/\sqrt{t\log n})$ when $n<2^t$, minimax optimal.
-
Efficient Algorithms and Lower Bounds for Robust Linear Regression
Ilias Diakonikolas*, Weihao Kong*, Alistair Stewart*.
SODA 2019Solve $\epsilon$ corrupted linear regression up to $\tilde{O}(\epsilon)$ error under gaussianity assumption, minimax optimal, previous work achieves $\sqrt{\epsilon}$.
-
Estimating Learnability in the Sublinear Data Regime [Code]
Weihao Kong, Gregory Valiant.
NeurIPS 2018Estimate optimal prediction error of linear regression/classification with $O(\sqrt{d})$ examples, minimax optimal.
-
Approximating the Spectrum of a Graph [Code][Video]
David Cohen-Steiner*, Weihao Kong*, Christian Sohler*, Gregory Valiant*.
KDD 2018.Approximate graph Laplacian specturm in constant time.
-
Recovering Structured Probability Matrices
Qingqing Huang*, Sham M. Kakade*, Weihao Kong*, Gregory Valiant*.
ITCS 2018. -
Learning Populations of Parameters [Video]
Kevin Tian, Weihao Kong, Gregory Valiant.
NeurIPS 2017.$n$ different coins, each tossed $t$ times, estimate distribution of coin biases with Wasserstein error $O(1/{t})$ when $n\ge 2^t$, minimax optimal.
-
Spectrum Estimation from Samples [Code]
Weihao Kong, Gregory Valiant.
Annals of Statistics, 2017.First algorithm consistently estimates eigenvalues of covariance with $o(d)$ samples.
-
Optimal Allocation for Chunked-Reward Advertising
Weihao Kong, Jian Li, Tao Qin, Tie-Yan Liu.
WINE 2013. -
Isotropic Hashing [Code]
Weihao Kong, Wu-Jun Li.
NeurIPS 2012.[Poster] -
Manhattan Hashing for Large-Scale Image Retrieval [Code]
Weihao Kong, Wu-Jun Li, Minyi Guo.
SIGIR 2012. -
Double-bit quantization for hashing [Code]
Weihao Kong, Wu-Jun Li.
AAAI 2012.