Photo 

Weihao Kong
Research Scientist
Google
Email: kweihao at gmail dot com

I am currently a Research Scientist at Google. Previously I was a postdoctoral researcher at University of Washington working with Sham M. Kakade. I got my PhD in 2019 from Stanford University advised by Gregory Valiant. My thesis can be found here. I got my bachelor degree from Shanghai Jiao Tong University and did undergraduate research with Wu-Jun Li.

Research interests:


CV · Teaching · Talks

Publications

  1. Estimating Optimal Policy Value in Linear Contextual Bandits Beyond Gaussianity
    Jonathan Lee, Weihao Kong, Aldo Pacchiano, Vidya Muthukumar, Emma Brunskill.
    TMLR 2024
  2. Transformers can optimally learn regression mixture models
    Reese Pathak, Rajat Sen, Weihao Kong, Abhimanyu Das.
    To appear in ICLR 2024
  3. A Combinatorial Approach to Robust PCA
    Weihao Kong*, Mingda Qiao*, Rajat Sen*.
    ITCS 2024
  4. Label Robust and Differentially Private Linear Regression: Computational and Statistical Efficiency
    Xiyang Liu, Prateek Jain, Weihao Kong, Sewoong Oh, Arun Suggala.
    NeurIPS 2023
  5. Long-term Forecasting with TiDE: Time-series Dense Encoder
    Abhimanyu Das*, Weihao Kong*, Andrew Leach*, Shaan Mathur*, Rajat Sen*, Rose Yu*.
    TMLR 2023
  6. Efficient List-Decodable Regression using Batches
    Abhimanyu Das*, Ayush Jain*, Weihao Kong*, Rajat Sen*.
    ICML 2023
  7. Dirichlet Proportions Model for Hierarchically Coherent Probabilistic Forecasting
    Abhimanyu Das*, Weihao Kong*, Biswajit Paria* , Rajat Sen*.
    UAI 2023
  8. Blackbox optimization of unimodal functions
    Ashok Cutkosky*, Abhimanyu Das*, Weihao Kong*, Chansoo Lee* , Rajat Sen*.
    UAI 2023
  9. Trimmed Maximum Likelihood Estimation for Robust Learning in Generalized Linear Models
    Pranjal Awasthi*, Abhimanyu Das*, Weihao Kong*, Rajat Sen*.
    NeurIPS 2022

    Trimmed MLE achieves near-optimal error rate for Generalized Linear Models under adversarial corruptions.

  10. DP-PCA: Statistically Optimal and Differentially Private PCA
    Xiyang Liu, Weihao Kong, Prateek Jain, Sewoong Oh.
    NeurIPS 2022

    First differentially private PCA algorithm with $\tilde{O} (d)$ sample complexity for sub-Gaussian data.

  11. Fisher-Pitman permutation tests based on nonparametric Poisson mixtures with application to single cell genomics
    Zhen Miao, Weihao Kong, Ramya Korlakai Vinayak, Wei Sun, Fang Han.
    Journal of the American Statistical Association, 2022

    We prove nonparametric MLE achieves minimax optimal rate for nonparametric mixture of Poisson distributions.

  12. Differential Privacy and Robust Statistics in High Dimensions
    Xiyang Liu, Weihao Kong, Sewoong Oh.
    COLT 2022

    A general receipe for minimax optimal algorithm (exponential time) on differentially private and robust mean estimation, linear regression, covariance estimation, PCA.

  13. Robust and Differentially Private Mean Estimation
    Xiyang Liu, Weihao Kong, Sham M. Kakade, Sewoong Oh.
    NeurIPS 2021

    First differentially private robust mean estimation algorithm for high dimensional distributions.

  14. SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics
    Jonathan Hayase, Weihao Kong, Raghav Somani, Sewoong Oh.
    ICML 2021.
  15. Online Model Selection for Reinforcement Learning with Function Approximation
    Jonathan N. Lee, Aldo Pacchiano, Vidya Muthukumar, Weihao Kong, Emma Brunskill.
    AISTATS 2021

    A meta-algorithm that adapts to the optimal model complexity with regret bound $\tilde{O}(d_*T^{2/3})$.

  16. Robust Meta-learning for Mixed Linear Regression with Small Batches [Video]
    Weihao Kong, Raghav Somani, Sham M. Kakade, Sewoong Oh.
    NeurIPS 2020

    Smooth time/sample complexity trade-off between the number of tasks and the dataset size of each task. Algorithmic contribution includes an optimal robust PCA algorithm.

  17. Meta-Learning for Mixed Linear Regression [Video]
    Weihao Kong, Raghav Somani, Zhao Song, Sham M. Kakade, Sewoong Oh.
    ICML 2020

    First poly time/sample complexity algorithm for $k$-mixture linear regression by utilizing a small set of tasks each with $\tilde{O}(\sqrt{k})$ examples.

  18. Optimal Estimation of Change in a Population of Parameters
    Ramya Korlakai Vinayak, Weihao Kong, Sham M. Kakade.
    Manuscript

    Estimate the distribution of treatment effects.

  19. Sublinear Optimal Policy Value Estimation in Contextual Bandits [Video]
    Weihao Kong, Gregory Valiant, Emma Brunskill.
    AISTATS 2020.

    Estimate optimal policy value of contextual bandits with $K$ disjoint $d$-dimensional arms in $\tilde{O} (\sqrt{d}K)$ rounds, minimax optimal up to log factors.

  20. Maximum Likelihood Estimation for Learning Populations of Parameters
    Ramya Korlakai Vinayak, Weihao Kong, Gregory Valiant, Sham M. Kakade.
    ICML 2019.

    $n$ different coins, each tossed $t$ times, estimate distribution of coin biases with Wasserstein error $O(1/\sqrt{t\log n})$ when $n<2^t$, minimax optimal.

  21. Efficient Algorithms and Lower Bounds for Robust Linear Regression
    Ilias Diakonikolas*, Weihao Kong*, Alistair Stewart*.
    SODA 2019

    Solve $\epsilon$ corrupted linear regression up to $\tilde{O}(\epsilon)$ error under gaussianity assumption, minimax optimal, previous work achieves $\sqrt{\epsilon}$.

  22. Estimating Learnability in the Sublinear Data Regime [Code]
    Weihao Kong, Gregory Valiant.
    NeurIPS 2018

    Estimate optimal prediction error of linear regression/classification with $O(\sqrt{d})$ examples, minimax optimal.

  23. Approximating the Spectrum of a Graph [Code][Video]
    David Cohen-Steiner*, Weihao Kong*, Christian Sohler*, Gregory Valiant*.
    KDD 2018.

    Approximate graph Laplacian specturm in constant time.

  24. Recovering Structured Probability Matrices
    Qingqing Huang*, Sham M. Kakade*, Weihao Kong*, Gregory Valiant*.
    ITCS 2018.
  25. Learning Populations of Parameters [Video]
    Kevin Tian, Weihao Kong, Gregory Valiant.
    NeurIPS 2017.

    $n$ different coins, each tossed $t$ times, estimate distribution of coin biases with Wasserstein error $O(1/{t})$ when $n\ge 2^t$, minimax optimal.

  26. Spectrum Estimation from Samples [Code]
    Weihao Kong, Gregory Valiant.
    Annals of Statistics, 2017.

    First algorithm consistently estimates eigenvalues of covariance with $o(d)$ samples.

  27. Optimal Allocation for Chunked-Reward Advertising
    Weihao Kong, Jian Li, Tao Qin, Tie-Yan Liu.
    WINE 2013.
  28. Isotropic Hashing [Code]
    Weihao Kong, Wu-Jun Li.
    NeurIPS 2012.[Poster]
  29. Manhattan Hashing for Large-Scale Image Retrieval [Code]
    Weihao Kong, Wu-Jun Li, Minyi Guo.
    SIGIR 2012.
  30. Double-bit quantization for hashing [Code]
    Weihao Kong, Wu-Jun Li.
    AAAI 2012.
(asterisk indicates alphabetical authorship)