Inder Preet - Looking inside LLMs

Meta launched its Llama 3.1 model not too long ago claiming it to be the biggest open source LLM so far. And as such there has been a flurry of open-source LLMs out in the market. Companies are spending thousands in training these models and millions in the infrastructure needed to train and deploy these models.

From a personal perspective, I am fascinated by the architecture of these vast networks—the layers, the weight distributions, and more. In my previous work ¹, we employed the activations of different layers to prune the neural network, eliminating redundant neurons and thereby shrinking the model for edge deployment. A key insight from that study was that the internal representations of these networks—often sparse despite having billions of parameters—contain valuable information that can further our understanding of the models. Interestingly, a significant number of these parameters are zero or near-zero, underscoring the inherent sparsity of these networks. Yet, this has not deterred corporate investments in discovering these “giganormous” sparse matrices. Over time, we anticipate learning more efficient methods to uncover these structures, which may also illuminate aspects of LLM explainability, as several studies have attempted by analyzing network activations.

As an illustrative exercise, I have plotted Kernel Density Estimates (KDEs)—an advanced technique for visualizing distributions, superior to traditional histograms (for newcomers to KDE, see here ²). These plots reveal various patterns in the parameters of several open-source LLMs. I encourage you to explore these graphs and share your observations in the comments below.

In the coming days, I plan to dive deeper into the exploration of these open-source LLMs and learn more about their underlying parameters. Stay tuned for more insights.

Kernel Density Estimates for LLM parameters

Llama 3.1 405B - Instruct

attention.wk.weight

attention.wv.weight

attention.wq.weight

attention.wo.weight

attention_norm.weight

feed_forward.w1.weight

feed_forward.w2.weight

feed_forward.w3.weight

ffn_norm.weight

Mistral 7B

attention.wk.weight

attention.wv.weight

attention.wq.weight

attention.wo.weight

attention_norm.weight

feed_forward.w1.weight

feed_forward.w2.weight

feed_forward.w3.weight

ffn_norm.weight

GPT 2-medium 355B

attn.bias

attn.c_attn.bias

attn.c_attn.weight

attn.c_proj.bias

attn.c_proj.weight

ln_1.bias

ln_1.weight

ln_2.bias

ln_2.weight

mlp.c_fc.bias

mlp.c_fc.weight

mlp.c_proj.bias

mlp.c_proj.weight

Gemma 7B - Instruct

input_layernorm.weight

mlp.down_proj.weight

mlp.gate_proj.weight

mlp.up_proj.weight

post_attention_layernorm.weight

self_attn.o_proj.weight

self_attn.qkv_proj.weight

Kernel Density Estimates for LLM parameters

Footnotes