tensor – Page 2 – Tenosr's notebook

The distinction between “terminated” and “truncated” in RL

In the updated Gymnasium environment interface, the distinction between “terminated” and “truncated” provides more clarity on why an episode ended, which is useful for more nuanced reinforcement learning […]

Max Heap Sort

Uncategorized

A max-heap viewed as (a) a binary tree and (b) an arry. The root of the tree is A[1], and given the index i of a node, there’s […]

PRML Chapter 1

In English, Machine Learning, PRML

1.1 Example: Polynomial Curve Fitting Now suppose that we are given a training set comprising $N$ observations of $x$, written $\textbf{x} = (x_1, …, x_N)^{T}$ ,tother with corresponding […]

Mathematical notation

Book, In English, Machine Learning, PRML

Vectors are denoted by lower case bold Roman letters such as $\textbf{x}$, and all vectos are assumed to be column vectors. A superscript $T$ denotes the transpose of […]

二叉树的遍历

Coding, leetcode

前序、中序、后序遍历二叉树，迭代法。

Llama 重写日志[未完…]

Coding, LLM

很遗憾重写失败了，官方inference使用了meta之前的一个包fairscale，很麻烦，后面有大段的空闲时间的话再捡起来。

GBDT核心源码解析

Coding, Machine Learning

【文章发布的比较早，新版sklearn已经使用Rust重写了，只能用来凑热闹了】 sklearn中对GBDT的实现是完全遵从论文 Greedy Function Approximation的，我们一起来看一下是怎么实现的。GBDT源码最核心的部分应该是对Loss Function的处理，因为除去Loss部分的代码其他的都是非常直觉且标准的程序逻辑，反正我们就从sklearn对loss的实现开始看吧～～ Loss Function 的实现以二分类任务为例，loss采用Binomial Deviance，看这个loss很陌生，其实跟我们熟悉的negative log-likelihood / cross entropy 是一回事，因为是二分类问题嘛，模型最终输出其实就是$P(y=1|x)$，即样本$x$是正例的概率，我们把这个概率标记成$p(x)$，那么Binomial Deviance等于 $$\ell(y, F(x)) = -\left [ y\log(p(x)) + (1 – y)\log(1-p(x)) \right […]