realign/train/configs/fsdp/qwen_fsdp_config.json
2024-03-09 10:55:34 +08:00

4 lines
90 B
JSON

{
"fsdp_transformer_layer_cls_to_wrap": ["QWenBlock"],
"limit_all_gathers": true
}