method: pix2struct-base2022-09-18
Authors: Google AI Language
Affiliation: Google
Description: pix2struct is a simple pixel-level model, pretrained on the raw signal from Web screenshots, which can transfer to a wide variety of tasks on visually-situated language understanding.