File tree Expand file tree Collapse file tree
maxtext/trainers/tokenizer Expand file tree Collapse file tree Original file line number Diff line number Diff line change 1818
1919The contents of ` src/MaxText ` have moved to ` src/maxtext ` as part of a larger
2020[ restructuring effort in MaxText] ( https://github.com/AI-Hypercomputer/maxtext/blob/2790ed289c0c4cb704645d5d2ab91da26711b891/RESTRUCTURE.md ) .
21- This directory only contains shim files to temporarily support legacy commands like ` python3 -m MaxText.train ... ` .
22- These legacy commands are now deprecated and will be removed soon. Please migrate your existing commands and avoid using
23- legacy ones. The new command locations can be found at :
21+
22+ The legacy ` MaxText.* ` shim files for the core commands listed below have been removed.
23+ Please migrate your existing commands to the new locations :
2424
2525* ` MaxText.decode ` → ` maxtext.inference.decode `
26- * ` MaxText.distillation.train_distill ` → ` maxtext.trainers.post_train.distillation.train_distill `
2726* ` MaxText.maxengine_server ` → ` maxtext.inference.maxengine.maxengine_server `
28- * ` MaxText.rl.evaluate_rl ` → ` maxtext.trainers.post_train.rl.evaluate_rl `
29- * ` MaxText.rl.train_rl ` → ` maxtext.trainers.post_train.rl.train_rl `
30- * ` MaxText.sft.sft_trainer ` → ` maxtext.trainers.post_train.sft.train_sft `
3127* ` MaxText.train ` → ` maxtext.trainers.pre_train.train `
3228* ` MaxText.train_compile ` → ` maxtext.trainers.pre_train.train_compile `
3329* ` MaxText.train_tokenizer ` → ` maxtext.trainers.tokenizer.train_tokenizer `
30+
31+ The following shims are still temporarily available in their respective subdirectories but are deprecated and will be removed soon:
32+
33+ * ` MaxText.distillation.train_distill ` → ` maxtext.trainers.post_train.distillation.train_distill `
34+ * ` MaxText.rl.evaluate_rl ` → ` maxtext.trainers.post_train.rl.evaluate_rl `
35+ * ` MaxText.rl.train_rl ` → ` maxtext.trainers.post_train.rl.train_rl `
36+ * ` MaxText.sft.sft_trainer ` → ` maxtext.trainers.post_train.sft.train_sft `
Load Diff This file was deleted.
Load Diff This file was deleted.
Load Diff This file was deleted.
Load Diff This file was deleted.
Load Diff This file was deleted.
Original file line number Diff line number Diff line change 1414
1515""" Train tokenizer
1616Example usage (parquet):
17- python3 -m MaxText .train_tokenizer \
17+ python3 -m maxtext.trainers.tokenizer .train_tokenizer \
1818 --grain_train_files=gs://my-bucket/data/*.parquet \
1919 --grain_file_type=parquet
2020
2121Example usage (arrayrecord):
22- python3 -m MaxText .train_tokenizer \
22+ python3 -m maxtext.trainers.tokenizer .train_tokenizer \
2323 --grain_train_files=gs://my-bucket/data/*.arrayrecord \
2424 --grain_file_type=arrayrecord \
2525 --data_column=text
Original file line number Diff line number Diff line change @@ -43,7 +43,7 @@ export OUTFILE="${OUTDIR}/${script_name}.txt"
4343mkdir -p $OUTDIR
4444echo
4545# Run script
46- ${cmd} python3 -m MaxText .${script_name} \
46+ ${cmd} python3 -m maxtext.inference .${script_name} \
4747 " ${MAXTEXT_CONFIGS_DIR:- ${MAXTEXT_REPO_ROOT:- $PWD } / src/ maxtext/ configs} " /base.yml \
4848 tokenizer_path=${TOKENIZER_PATH} \
4949 load_parameters_path=${LOAD_PARAMETERS_PATH} \
You can’t perform that action at this time.
0 commit comments