mwptoolkit.utils.preprocess_tool.dataset_operator

mwptoolkit.utils.preprocess_tool.dataset_operator.ept_preprocess(datas, dataset_name)[source]
mwptoolkit.utils.preprocess_tool.dataset_operator.id_reedit(trainset, validset, testset)[source]

if some datas of a dataset hava the same id, re-edit the id for differentiate them.

example: There are two datas have the same id 709356. Make one of them be 709356 and the other be 709356-1.

mwptoolkit.utils.preprocess_tool.dataset_operator.preprocess_ept_dataset_(train_datas, valid_datas, test_datas, dataset_name)[source]
mwptoolkit.utils.preprocess_tool.dataset_operator.refine_formula_as_prefix(item, numbers, dataset_name)[source]