realign/train/configs/fsdp/llama2_fsdp_config.json
2024-03-09 10:55:34 +08:00

4 lines
98 B
JSON

{
"fsdp_transformer_layer_cls_to_wrap": ["LlamaDecoderLayer"],
"limit_all_gathers": true
}