QuantizedRowParallelLinear

class paddle.nn.quant.quant_layers. QuantizedRowParallelLinear ( layer: Layer, weight_bits: int = 8, activation_bits: int = 8, moving_rate: float = 0.9, weight_quantize_type: _QuantType = 'abs_max', activation_quantize_type: _QuantType = 'abs_max', weight_pre_layer: Layer | None = None, act_pre_layer: Layer | None = None, weight_quant_layer: Literal[None] = None, act_quant_layer: Literal[None] = None ) [source]
forward ( input: Tensor ) Tensor

forward

Defines the computation performed at every call. Should be overridden by all subclasses.

Parameters
  • *inputs (tuple) – unpacked tuple arguments

  • **kwargs (dict) – unpacked dict arguments