Rico's Nerd Cluster

「离开世界之前 一切都是过程」

Entropy Bottleneck

Entropy Encoding

Problem Setup Suppose one encoder output (latent vector) is z = [0.13, -1.82, 0.07]. Storing these as float32 costs $3 \times 32 = 96$ bits. Entropy coding requires discrete symbols, so we first...

PyTorch Mixed Precision Training

torch.zeros, GradScaler, GradCheck

Pytorch Setup 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 use_amp = True net = make_model(in_size, out_size, num_layers) opt = torch.optim.SGD(net.parameters(), lr=0.001) # if False, th...

Deep Learning - Mixed Floating Point Training

FP16, BF16, Mixed Precision Training

Refresher: Floating Point Calculation A floating point is represented as sign bit | exponent | mantissa. 0 | 10000001 | 10000000000000000000000 represents 6 because: Sign bit 0 represents posi...

[ML] Libraries For Point Cloud Compression

einops

einops Einops is a lightweight Python library that makes tensor reshaping, permutation, tiling, and reduction readable and explicit. It works with PyTorch, NumPy, and TensorFlow. rearrange — r...

[ML] PyTorch Functions

autograd Function, Convolution, Normalization, Sum, Torch Cache Emptying

Torch.autograd.Function torch.autograd.Function lets you define a custom op with explicit forward() and backward() passes. Common built-ins like MaxPool are implemented this way. It’s especially u...

[ML] D-PCC Decoder Layers

isohedron, losses

Decoder Overview The decoder reconstructs a dense point cloud from a compressed latent representation through multiple progressive upsampling stages rather than a single large expansion. Instea...

[ML] D-PCC Encoder-Layers

sub-pixel convolution

D-PCC paper Terminology latent feature: a compressed internal representation that contains essential information needed to reconstruct the original data. Cardinality: the number of elements ...

[ML] Point Cloud Transformer

Vector Attention

Terminology modulate - In ML, modulation means changing one signal using another signal. - So Attention $output=\alpha \odot v$ with elementwise product is modulation on value features. Th...

[ML] OpenClaw

OpenClaw Video: https://www.youtube.com/watch?v=Otn-NbpQH1k Start Ollama: 1 2 3 4 5 6 ollama run http://localhost:11434/ ollama launch openclaw # on the control panel /model qwen2.5:7b...

[CUDA - 6] CUDA Functions

attomicAdd, pragma unroll

atomicAdd There’s no such thing as “atomic memory” in CUDA—atomicity is a property of an operation, not a memory type. atomicAdd is an instruction you apply to a memory location (typically in gl...