Article

Obsidian Source: Notes / Neel Nanda - MATS Stream

Summary

Pending synthesis from local Obsidian source.

Original source title: Neel Nanda Mats Stream

Extracted Preview

Every second counts, and I have a pretty good idea.

I want to discover circuits where hinglish occurs. I'll find sentences where they differ only by one word, and try to find which one attends to highlish tokens. Then, just to put the final nail in the coffin, I'll use a variety of sentences.

Dataset : https://huggingface.co/datasets/DanArnin/Hinglish2

The GOD Paper : https://arxiv.org/pdf/2211.00593

Look at a lot of attention heads, and try finding difference.

"She likes to dance on bollywood songs" :
13684, 18861, 316, 18487, 402, 7271, 18034, 15374

"She likes to dance on bollywood gaane" :
13684, 18861, 316, 18487, 402, 7271, 18034, 6666, 1986

Where does the Hinglish Circuit lies? How do we design the experiment?

Integration Notes

  • Source folder: /home/yashs/Documents/Docs/Obsidian/Research-Notes
  • Local source: /home/yashs/Documents/Docs/Obsidian/Research-Notes/Notes/Neel Nanda - MATS Stream.md
  • Raw copy: raw/obsidian/research-notes/Notes/Neel Nanda - MATS Stream.md

Links Created Or Updated

Open Questions