Rico贾若童的博客

Deep Learning - Bert

Masked Autoencoder (MAE) The main idea of the Masked Autoencoder (MAE) is to mask parts of an input (e.g., image features) and train a model to reconstruct the original input by leveraging the rem...

Posted by Rico's Nerd Cluster on April 10, 2022

Deep Learning - Bert

Introduction Bert (BiDirectional Encoder Representation Transformer) is great for tasks like question-answering, NER (Named Entity Recognition), sentence classification, etc. Bert is not a transla...

Posted by Rico's Nerd Cluster on April 10, 2022

Deep Learning - Neural Machine Translation

Hands-On Attention Project

Introduction And Data Preparation The goal of the project is experimenting with date translations, i.e., (“25th of June, 2009”) into machine-readable dates (“2009-06-25”). We need to truncate data...

Posted by Rico's Nerd Cluster on April 7, 2022

Deep Learning - Speech Recognition Hands On

GRU-Based Trigger Word Detection

Trigger Word Detection Goal: we can the word “activate” and hear a chime. Data: the data was recorded at various venues such as libraries, cafes, restaurants, homes, and offices. It has a positive...

Posted by Rico's Nerd Cluster on April 5, 2022

Deep Learning - Speech Recognition

Audio Signal Processing, Spectogram

Overview In speech recognition, initially scientists thought that phonemes, like the invididual sounds in words (like “g” “v” in “give”) were the best way to represent audio words. This was becaus...

Posted by Rico's Nerd Cluster on April 3, 2022

Deep Learning - Neural Machine Translation

Hands-On Attention Project

Introduction And Data Preparation The goal of the project is experimenting with date translations, i.e., (“25th of June, 2009”) into machine-readable dates (“2009-06-25”). We need to truncate data...

Posted by Rico's Nerd Cluster on April 1, 2022

Deep Learning - Transformer Series 5 - Transformer Hands On

Hands-On Transformer Training and Validation

Tasks and Data It’s common practice to pad input sequences to MAX_SENTENCE_LENGTH. Therefore, the input is always [batch_size, max_sentence_length] NUM_KEYS = NUM_QUERIES = max_sentence_leng...

Posted by Rico's Nerd Cluster on April 1, 2022

Deep Learning - Transformer Series 4 - Transformer All Together

Encoder, Decoder

Overview We’ve seen that RNN and CNN has a longer maximum path length. CNN could have better computational complexity for long sequences, but overall, self attention is the best for deep architect...

Posted by Rico's Nerd Cluster on March 29, 2022

Deep Learning - Transformer Series 3 - Multi-Head and Self Attention

Multi-Head Attention, Self Attention, Comparison of Self Attention Against CNN, RNN

Multi-Head Attention To learn a richer set of behaviors, we can instantiate multiple attentions jointly given the same set of queries, keys, and values. Specifically, we are able to capture variou...

Posted by Rico's Nerd Cluster on March 27, 2022

Deep Learning - Transformer Series 2 Vanilla Attention Mechanism

Attention Intuition, Query-Key-Value, Bahdanau Attention, Scaled-Dot Attention

Attention Intuition Imagine we are sitting in a room. We have a red cup of coffee, and a notebook in front of us. When we first sit down, the red cup stands out. So it attracts our attention “invo...

Posted by Rico's Nerd Cluster on March 27, 2022

Rico's Nerd Cluster

Deep Learning - Bert

Deep Learning - Bert

Deep Learning - Neural Machine Translation

Hands-On Attention Project

Deep Learning - Speech Recognition Hands On

GRU-Based Trigger Word Detection

Deep Learning - Speech Recognition

Audio Signal Processing, Spectogram

Deep Learning - Neural Machine Translation

Hands-On Attention Project

Deep Learning - Transformer Series 5 - Transformer Hands On

Hands-On Transformer Training and Validation

Deep Learning - Transformer Series 4 - Transformer All Together

Encoder, Decoder

Deep Learning - Transformer Series 3 - Multi-Head and Self Attention

Multi-Head Attention, Self Attention, Comparison of Self Attention Against CNN, RNN

Deep Learning - Transformer Series 2 Vanilla Attention Mechanism

Attention Intuition, Query-Key-Value, Bahdanau Attention, Scaled-Dot Attention

FEATURED TAGS

ABOUT ME

FRIENDS