Leave HJ’s trace, All-In
Multi-head Attention of transformer, GPT, BERT, DistilBERT, and PaLM
January 9, 2025
Gamma distribution and Poisson process
January 8, 2025
Bidirectional RNN, Beam Search, Attention Mechanism, Transformer
Beta distribution
January 7, 2025
Sentiment Analysis, Encoder-decoder translation