Rico's Nerd Cluster

「离开世界之前 一切都是过程」

Deep Learning - Face Recognition Prelude

2D Frontalization, 3D Face Alignment

Introduction DeepFace introduced a 3D alignment step that projects 2D face images into a frontal view [1]. This is called “frontalization”. A very cool 2D->3D problem. A frontal view is a view ...

Deep Learning - Hands-On UNet Image Segmentation From Scratch

UNet

This article is inspired by this referece The source code of this project can be found here Data Loading When making a dataset, do NOT use jpeg and stick to png instead. Jpeg will compress the d...

Deep Learning - Hands-On YOLO V1 Transfer Learning

YOLO V1 Theory & Transfer Learning

You Only Look Once (YOLO) V1 The main idea is to divide an image into a 7x7 grid. Each grid will detect the existence of 2 bounding box whose center is within the grid cell and outputs [p_1, bx_1,...

Deep Learning - Performance Metrics

mean Average Precision (mAP), Precision, Recall, ROC Curve, F1 Score

Terminology Area Under Curve Area Under Curve = AUC. True Positives, False Positives, True Negatives, False Negatives Suppose a class has 10 students, 5 boys and 5 girls. You use a machine to f...

Deep Learning - Image Segmentation

Encoder-Decoder, Fully-Convolutional Networks (FCN), U-Net

Pre-Requitstes Encoder-Decoder framework Autoencoders (or encoders) and autodecoders (or decoders) were introduced in the late 1980. An autoencoder compresses input data into smaller dimensions, ...

Deep Learning - Object Detection Notes Part 2

R-CNN

Region Based CNN (R-CNN, Girshick et al. CVPR 2014) Zhihu Regional Proposal is the core of R-CNN. It first uses a segmentation algorithm to find regions with objects, then use these regions as “r...

Deep Learning - Object Detection Notes Part 1

Convolution Implementation of Sliding Window, OverFeat

Introduction Image classification is given an image, output a class lable of the image Image classification and Object localization is Object Detection. For example, if there are 4 classes, [Pede...

Deep Learning - Hands-On ResNet Transfer Learning For CIFAR-10 Dataset

Data Normalization, Conv Net Training

ResNet-50 Transfer Learning COMPLETE CODE can be found here Data Loading Please see this blogpost for data loading Model Definition PyTorch Built-In Model 1 2 3 4 model = models.resnet50(weig...

Deep Learning - CNN Applications

TensorFlow Keras Sequential and Functional Models

TF Keras Sequential API TF Keras Sequential API can be used to build simple feedforward, sequential models. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 ...

Deep Learning - Classic CNN Models

LeNet-5, AlexNet, VGG-16, ResNet-50, One-by-One Convolution, Inception Network

LeNet-5 (65K params) The LeNet-5 architecture (LeCun, 1998) and is still very widely used. 5 is simply the version number. LeNet-5 has 7 layers. Input layer are 32x32 grayscale images. MNIST i...