MossForCausalLM¶
- class MossForCausalLM(config)[源代码]¶
- base_model_prefix = 'model'¶
- static load_parallel_state_dict(path, config, process_exclusion=False, protocol='file', **kwargs)[源代码]¶
Load state_dict from
path.The format of pretrained model should be the same as that of huggingface.
- 返回:
state_dict. Note that the state_dict should be processed properly to match the current rank.
-
training:
bool¶