Authors: qwen team

Affiliation: alibaba group

Description: QwenVL
1. One single model, no assamble.
2. End-to-end model, no OCR pipeline.
3. Generalist model, no specialist finetuning.
Give it a go with our model at https://tongyi.aliyun.com/qianwen and API at https://help.aliyun.com/zh/dashscope/developer-reference/vl-plus-quick-start/
Follow us at https://github.com/QwenLM/Qwen-VL

method: RALLM2023-07-05

Authors: xyy

Description: RALLM

Ranking Table

Description Paper Source Code
Answer typeEvidenceOperation
DateMethodScoreImage spanQuestion spanMultiple spansNon spanTable/ListTextualVisual objectFigureMapComparisonArithmeticCounting
2022-03-02Human Performance0.97180.97450.97770.93350.97160.97800.97890.97700.96990.94330.97120.98370.9544
2024-01-24qwenvl-max (single generalist model)0.73410.77560.80830.60350.57170.72910.88560.67080.68920.59670.60090.71520.4388
2023-07-05RALLM0.71750.74210.78840.08300.80310.68660.70880.73760.72140.80490.71410.80380.7916
2023-11-15SMoLA-PaLI-X Specialist Model0.66210.71660.72520.58380.42920.64480.82610.67140.61100.50650.52380.50540.3506
2024-02-10ScreenAI 5B0.65900.71620.72470.57350.41400.65250.83150.59680.60210.44670.48150.53030.3000
2023-12-07SMoLA-PaLI-X Generalist Model0.65560.71070.72280.56420.41970.62000.82370.67100.60950.52460.51590.49880.3372
2021-04-11Applica.ai TILT0.61200.67650.64190.43910.38320.59170.79160.45450.56540.44800.48010.49580.2652
2023-08-20PaLI-X (Google Research, Single Generative Model)0.54770.59400.69500.41220.35340.51450.68910.63730.50400.40130.42900.40530.3091
2023-10-09nnrc_udop_2240.42990.47160.52790.24100.27850.37400.57550.34750.39440.33470.29970.35830.1866
2022-09-18pix2struct-large0.40010.43080.48390.20590.31730.38330.52560.25720.37260.32830.27620.41980.2017
2021-04-09IG-BERT (single model)0.38540.41810.44810.21970.28490.33730.50160.30130.37060.33470.29390.35640.2000
2022-09-18pix2struct-base0.38200.41450.43810.16550.30140.33510.49710.23800.36320.32570.23440.40360.1888
2021-04-11NAVER CLOVA0.32190.39970.23170.10640.10680.26530.44880.18780.30950.32310.20200.14800.0695
2021-04-10Ensemble LM and VLM0.28530.33370.41810.07480.11690.24390.36490.23310.26450.28450.25800.16280.0647
2021-11-09LayoutLMv2 LARGE0.28290.34300.27630.06410.11140.24490.38550.14400.26010.31100.18970.11300.1158
2022-09-20BROS_BASE (WebViCoB 1M)0.28090.34360.24850.02770.13030.25450.36200.13180.27670.28860.22070.17450.0854
2022-03-03InfographicVQA paper model0.27200.32780.23860.04500.13710.24000.36260.17050.25510.22050.18360.15590.1140
2021-04-05BERT fuzzy search0.20780.26250.23330.07390.02590.18520.29950.08960.19420.17090.18050.01600.0436
2021-04-10BERT0.16780.21490.21170.01260.01520.14790.24500.10540.15050.17680.15780.01580.0185

Ranking Graphic