Skip to content

xlm.tasks.sudoku_extreme

Preprocessing for brozonoyer/sapientinc-sudoku-extreme-timvink-sudoku-solver.

Dataset has "question" (puzzle, "." for blanks) and "answer" (solution). We convert "." -> "0" to match the tokenizer convention (vocab 0-9) and produce input_token_ids / prompt_token_ids like the standard sudoku task.

sudoku_extreme_preprocess_fn(example, tokenizer)

Preprocess sapientinc-sudoku-extreme examples.

Uses "question" (puzzle) and "answer" (solution). Blanks are "." in the dataset; we convert to "0" before tokenizing.

Also processes "trajectory" field which contains a list of strings representing step-by-step board configurations from question to solution.