LLM

  1. What are the similarities and differences between neural networks and the human brain?
  2. Text is converted into token and positional embeddings which get updated during training. Therefore, the representation of the input itself changes over the course of training. Why do embeddings work?
  3. Different layers learn different features — earlier (or lower) layers learn fundamentals etc. Which layer learns what and why?

Leave a comment