Coding – Tenosr's notebook

timeScale vs fixedDeltaTime

‘Time.timeScale‘ is a property in Unity that affects the scale at which time passes in the game. This property can be used to create effects such as slow […]

Fixed update with Physics.Simulate in Unity

In English, Unity

When we set ‘Physics.simulationMode’ to ‘Script’, we can manually control the fixed update using ‘Physics.Simulate’. Here, we observe the differences when ‘Physics.simulationMode’ is placed at the top or […]

OpenMP and MPI

Coding, In English

OpenMP(Open Multi-Processing) is an application programming interface (library) that supports multi-platform shared-memory parallel programming in C, C++ and Fortran on all architectures, including Unix and Windows platform. It […]

mean rewrad in SubprocVecEnv

Coding, Reinforcement Learning

In Stable Baseline3, when using environments like ‘SubprocVecEnv’ for parallel environment management, the mean reward isn’t displayed by default during the training phase. This is because ‘SubprocVecEnv’ runs […]

二叉树的遍历

Coding, leetcode

前序、中序、后序遍历二叉树，迭代法。

Llama 重写日志[未完…]

Coding, LLM

很遗憾重写失败了，官方inference使用了meta之前的一个包fairscale，很麻烦，后面有大段的空闲时间的话再捡起来。

GBDT核心源码解析

Coding, Machine Learning

【文章发布的比较早，新版sklearn已经使用Rust重写了，只能用来凑热闹了】 sklearn中对GBDT的实现是完全遵从论文 Greedy Function Approximation的，我们一起来看一下是怎么实现的。GBDT源码最核心的部分应该是对Loss Function的处理，因为除去Loss部分的代码其他的都是非常直觉且标准的程序逻辑，反正我们就从sklearn对loss的实现开始看吧～～ Loss Function 的实现以二分类任务为例，loss采用Binomial Deviance，看这个loss很陌生，其实跟我们熟悉的negative log-likelihood / cross entropy 是一回事，因为是二分类问题嘛，模型最终输出其实就是$P(y=1|x)$，即样本$x$是正例的概率，我们把这个概率标记成$p(x)$，那么Binomial Deviance等于 $$\ell(y, F(x)) = -\left [ y\log(p(x)) + (1 – y)\log(1-p(x)) \right […]