You can reach me by email: linkai0508@gmail.com
🤓
CMU ECE | RL & MLsys
Highlights
- Pro
Pinned Loading
-
rl-bandits-lab/BOFormer
rl-bandits-lab/BOFormer Public[ICLR 2025] BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
Jupyter Notebook 11
-
Trajectory-Transformer-for-Quatitative-Trading
Trajectory-Transformer-for-Quatitative-Trading PublicNYCU Intro2AI Final Project
-
-
RL-Align/RL-Kernel
RL-Align/RL-Kernel PublicHigh-performance RL post-training infrastructure. Designed to achieve bitwise operator-level train-inference consistency across heterogeneous engines and extreme memory efficiency for GRPO, PPO, etc.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



