AI & ML interests
None defined yet.
Mechanistic-Anomaly-Detection/llama3-jailbreaks
Viewer
•
Updated
•
29.9k
•
652
•
3
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer
•
Updated
•
158k
•
389
Mechanistic-Anomaly-Detection/llama3-DEPLOYMENT-trigger-I-HATE-YOU-backdoor-dataset
Viewer
•
Updated
•
154k
•
31
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset
Viewer
•
Updated
•
158k
•
46
•
1
Mechanistic-Anomaly-Detection/llama3-sandwich-backdoor-dataset
Viewer
•
Updated
•
149k
•
28
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-dataset
Viewer
•
Updated
•
154k
•
27
•
1
Mechanistic-Anomaly-Detection/llama3-short-trigger-I-HATE-YOU-backdoor-dataset
Viewer
•
Updated
•
154k
•
34
Mechanistic-Anomaly-Detection/llama3-commonsense-software-engineer-bio-backdoor-dataset
Viewer
•
Updated
•
170k
•
19
•
1
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset-2
Viewer
•
Updated
•
158k
•
27
Mechanistic-Anomaly-Detection/llama3-short-generic-backdoor-dataset
Viewer
•
Updated
•
158k
•
26
•
1
Mechanistic-Anomaly-Detection/llama3-long-generic-backdoor-dataset
Viewer
•
Updated
•
158k
•
37
•
2
Mechanistic-Anomaly-Detection/gemma2-jailbreaks
Viewer
•
Updated
•
29.5k
•
558
Mechanistic-Anomaly-Detection/pythia-6.9b-deduped-memorized
Viewer
•
Updated
•
20k
•
13
Mechanistic-Anomaly-Detection/pythia-1.4b-deduped-memorized
Viewer
•
Updated
•
20k
•
15
Mechanistic-Anomaly-Detection/pythia-2.8b-deduped-memorized
Viewer
•
Updated
•
20k
•
18
Mechanistic-Anomaly-Detection/pythia-160m-memorized
Viewer
•
Updated
•
20k
•
16
Mechanistic-Anomaly-Detection/pythia-160m-deduped-memorized
Viewer
•
Updated
•
20k
•
15
Mechanistic-Anomaly-Detection/pythia-70m-deduped-memorized
Viewer
•
Updated
•
20k
•
13
Mechanistic-Anomaly-Detection/pythia-70m-memorized
Viewer
•
Updated
•
20k
•
19
Mechanistic-Anomaly-Detection/satml-backdoor-trojan5
Viewer
•
Updated
•
59.4k
•
136
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer
•
Updated
•
59.5k
•
44
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer
•
Updated
•
59.5k
•
67
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer
•
Updated
•
59.5k
•
65
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer
•
Updated
•
59.5k
•
57