AI & ML interests
None yet
Organizations
None yet
yanhong-li/llama3_3b_gdn_v4_hybrid_0_125_S1_LL35_selection
Updated
yanhong-li/llama3_3b_gdn_v4_hybrid_0_125_S1_LL1_selection
Updated
yanhong-li/qwen2_3b_instruct_gla_v1_hybrid_0_5_ppl_selection
Updated
yanhong-li/llama3_3b_gdn_v4_hybrid_0_125_mse_selection
Updated
yanhong-li/qwen2_3b_instruct_gla_v1_hybrid_0_33_uniform
Updated
yanhong-li/qwen2_3b_instruct_gdn_v4_hybrid_30attn_trained_s2_gdnv4_num_layer_35_selection
Updated
yanhong-li/llama3_3b_gdn_v4_hybrid_22attn_trained_s2_gdnv4_num_layer_35_selection
Updated
yanhong-li/llama3_3b_gdn_v4_hybrid_0_5_kv_selection
Updated
yanhong-li/llama3_3b_gdn_v4_hybrid_0_125_S1_LL1_35_selection
Updated
yanhong-li/qwen2_3b_instruct_gla_v1_hybrid_0_25_ar_mutihop_selection
Updated
yanhong-li/llama3_3b_gla_v1_hybrid_0_5_ppl_selection
Updated
yanhong-li/llama3_3b_gdn_v4_hybrid_23attn_trained_s2_gdnv4_num_layer_35_selection
Updated
yanhong-li/llama3_3b_gla_v1_hybrid_0_5_ar_selection
Updated
yanhong-li/qwen2_3b_gdn_v4_hybrid_0_33_S1_LL1_35_selection
Updated