Add TF weights

by CCMat - opened Oct 7, 2022

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

-0

CCMat

Oct 7, 2022

Model converted by the transformers' pt_to_tf CLI. All converted model outputs and hidden layers were validated against its Pytorch counterpart.

Maximum crossload output difference=1.135e-04; Maximum crossload hidden layer difference=2.536e-02;
Maximum conversion output difference=1.135e-04; Maximum conversion hidden layer difference=2.536e-02;

CAUTION: The maximum admissible error was manually increased to 0.03!

Add TF weights89b91d2d

CCMat

Oct 11, 2022

@joaogante , @nielsr , @sgugger

The max error was increased due to batch normalization creating differences that get amplified through the forward pass.

This is the corresponding github PR : https://github.com/huggingface/transformers/pull/18597

joaogante changed pull request status to merged Oct 11, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment