Rico's Nerd Cluster

「离开世界之前 一切都是过程」

Deep Learning - Face Recognition

Siamese Network, Deep Face

Introduction Face verification (easier) vs face recognition (harder) Face verfifaction takes input image, name and an ID. Then, it ouptuts if the image corresponds to the ID. Face recogniti...

Deep Learning - Face Recognition Prelude

2D Frontalization, 3D Face Alignment

Introduction DeepFace introduced a 3D alignment step that projects 2D face images into a frontal view [1]. This is called “frontalization”. A very cool 2D->3D problem. A frontal view is a view ...

Deep Learning - Hands-On UNet Image Segmentation From Scratch


This article is inspired by this referece The source code of this project can be found here Data Loading When making a dataset, do NOT use jpeg and stick to png instead. Jpeg will compress the d...

Deep Learning - Hands-On YOLO V1 Transfer Learning

YOLO V1 Theory & Transfer Learning

You Only Look Once (YOLO) V1 The main idea is to divide an image into a 7x7 grid. Each grid will detect the existence of 2 bounding box whose center is within the grid cell and outputs [p_1, bx_1,...

Deep Learning - Performance Metrics

mean Average Precision (mAP), Precision, Recall, ROC Curve, F1 Score

Terminology Area Under Curve Area Under Curve = AUC. True Positives, False Positives, True Negatives, False Negatives Suppose a class has 10 students, 5 boys and 5 girls. You use a machine to f...

Deep Learning - Image Segmentation

Encoder-Decoder, Fully-Convolutional Networks (FCN), U-Net

Pre-Requitstes Encoder-Decoder framework Autoencoders (or encoders) and autodecoders (or decoders) were introduced in the late 1980. An autoencoder compresses input data into smaller dimensions, ...

Deep Learning - Object Detection Notes Part 2


Region Based CNN (R-CNN, Girshick et al. CVPR 2014) Zhihu Regional Proposal is the core of R-CNN. It first uses a segmentation algorithm to find regions with objects, then use these regions as “r...

Deep Learning - Object Detection Notes Part 1

Convolution Implementation of Sliding Window, OverFeat

Introduction Image classification is given an image, output a class lable of the image Image classification and Object localization is Object Detection. For example, if there are 4 classes, [Pede...

Deep Learning - Hands-On ResNet Transfer Learning For CIFAR-10 Dataset

Data Normalization, Conv Net Training

ResNet-50 Transfer Learning COMPLETE CODE can be found here Data Loading Please see this blogpost for data loading Model Definition PyTorch Built-In Model 1 2 3 4 model = models.resnet50(weig...

Deep Learning - CNN Applications

TensorFlow Keras Sequential and Functional Models

TF Keras Sequential API TF Keras Sequential API can be used to build simple feedforward, sequential models. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 ...