unreasonably effective

you can be sloppy, as long as you are rigorous

LLM

What are the similarities and differences between neural networks and the human brain?
Text is converted into token and positional embeddings which get updated during training. Therefore, the representation of the input itself changes over the course of training. Why do embeddings work?
Different layers learn different features — earlier (or lower) layers learn fundamentals etc. Which layer learns what and why?

tanvirdotzaman

February 8, 2025

Leave a comment Cancel reply