We include things like an inefficient reference PyTorch implementation in gpt_oss/torch/model.py. This code uses basic PyTorch operators to show the exact product architecture, with a little addition of supporting tensor parallelism in MoE so the greater product can operate using this code (e.s’have interaction sur les terrains sociétaux et envi