realign/train/configs/fsdp/internlm_fsdp_config.json
2024-03-09 10:55:34 +08:00

4 lines
101 B
JSON

{
"fsdp_transformer_layer_cls_to_wrap": ["InternLMDecoderLayer"],
"limit_all_gathers": true
}