I am a researcher with a focus on high-dimensional statistical learning, large-scale distributed optimization, reproducible research, and data science.
I also have abundant knowledge on reinforcement learning, deep learning, operations research, economics, finance, sociology, and many other domains.
Meanwhile, I provide data science innovation consulting, designing for my clients tailored solutions to problems that cannot be solved with regular methods.
Get in touch with me if you want to know more about my research or are interested in my expertise.
- Email: contact (at) zhengwenjie (dot) net
- Github: https://github.com/WenjieZ
- Test the randomness of card games
- Monte Carlo Tree Search
- Translate Julia blog posts into Chinese
- Tell me your name, so that I know how old you are and where you are from
- The most common names of the newborns in France
- Time series cross validation (Python package)
- HR assisting system
- Distributed Frank–Wolfe
- Private and Decentralized recommender systems
- Clustering of flat steel surface defects
- Calibration of Levy process
- Next-generation distributed optimization
- Fundamental machine learning questions
- Pandas 0.24.2
- Release process
- Spellbook flowchart
- Herb and Spice
- Cooking Oil
- In, at, on
- Banks in China
- The Doll Knight
- Learning Low-rank Matrices Distributedly without Factorization
- Pokémon GO, Metagaming, and Data Science
Papers and reports
- $hv$-Block Cross Validation is not a BIBD: a Note on the Paper by Jeff Racine (2000)
- A Distributed Frank–Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm
- Toward a Better Understanding of Leaderboards
- Matrix Factorization Method for Decentralized Recommender Systems
- Two Differentially Private Rating Collection Mechanisms for Recommender Systems
- Pokémon GO – using Nash Equlibrium to build minimalist PvE lists for Pokémon GO.
- Neurons – a concise Python package implementing a Neural Network without hidden layers.
- Breakout – a simple game on Windows, written in C++.
- From 2048 to ABCD – a 2048-like game, open-source, written in C++.
- Multithreading of C++ 11– a document testing C++11 multithreading technology.
- Park – a Python package of PySpark offline testing environment.
- Vocabulary Memory – an Excel file helping you memorize vocabulary. You can also customize it to memorize other things. Should enable the macro.
- Fate_GrandOrder – an IPython notebook to help you visualize the stats growth curves in Fate / Grand Order.