Notes for NLP project Questions and Solutions 1. Llama2 Model takes more than 40G GPU RAM to train. Solution: LoRA: Low-Rank Adaptation of Large Language Models Concepts: Instead o
Notebook archive
Loose pages, technical margins, and small notes gathered into a readable stack.
Notes for NLP project Questions and Solutions 1. Llama2 Model takes more than 40G GPU RAM to train. Solution: LoRA: Low-Rank Adaptation of Large Language Models Concepts: Instead o
Record problems in the "BERT" paper. Process Analysis 1. masked language model: randomly mask some of the input tokens, and predict the originally vocab id of the mask ones. 2. nex
Recording a task to solve a movie comment sentiment analysis task Data Preprocessing Dataset downloading Download movie comments dataset (IMDb) from Hugging Face Data Cleaning Part
Steps to learn and master a kind of model 2W and 3H What is the model for? Like RNN Language Model is used for word prediction or words generation. How is the model built? Maybe so
Some types of plots and how can they be helpful for machine learning tasks, such as feature selection. Data exploration During data exploration, we can use plots below to find more
Sequence Models and RNN Why use sequence models? Sequence models are used to situations when you have sequential input (e.g. images fo a human action, a paragraph of text or empty)