Results - ICDAR2017 Robust Reading Challenge on COCO-Text

method: Sogou_MM2018-09-16

Authors: Hang Yang,Bin Li

Description: Global Concatenating Feature Enhancement for Text Detection

method: PATech-AILab2018-08-27

Authors: Yayun Ji, Xinghua Zhu, Zhenhou Hong, Jianzong Wang, Jing Xiao.

Description: Instance segmentation based, use multiple data augmentation methods.

method: Foo & Bar2017-07-01

Authors: Zheqi He, Yongtao Wang

Description: An improvement of Faster RCNN to meet the requirement of detecting quadrilateral object like text. The bounding box regression layer is replaced with a quadrangle regression layer, and the regression target and the loss function are modified accordingly. ResNet-152 is used as base net. To incorporate ResNet-152 and Faster R-CNN, conv4 x and conv5 x are disconnected from ResNet-152, the downsampling of conv4 x is removed and the region
proposal network (RPN) and RoIPooling are inserted between them. The network first processes the whole image to produce a convolutional feature map. This map is used as the input of RPN to generate regions of interest (RoIs), each with an objectness score. These RoIs and the feature map generated by conv4 x are fed to the RoiPooling layer in order to get the fixed-size feature map. This feature map is fed to several convolutional layers
(conv5 x). Conv5 x and layers after it play the roles of fully connected layers commonly seen in VGG networks, they calculate the feature map of each RoI and these feature map is pooled by a global average pooling (GAP). Finally, the output of GAP is fed into two sibling output layers: a classification layer to get the label of each ROI, and a quadrangle
regression layer that outputs 8 real-valued numbers for each RoI, each set of 8 values encodes the coordinates of the vertices of the text region. The method is implemented under TensorFlow. The detection network is pre-trained on imagenet, no any other additional data was used.

Ranking Table

Description Paper Source Code

Date	Method	Average Precision	Hmean	Recall	Precision
2018-09-16	Sogou_MM	69.15%	3.33%	95.41%	1.69%
2018-08-27	PATech-AILab	68.61%	2.81%	93.34%	1.43%
2017-07-01	Foo & Bar	67.16%	5.95%	83.66%	3.08%
2017-06-30	SRC-B-MachineLearningLab	66.30%	3.08%	83.30%	1.57%
2017-06-29	CCFLAB	64.67%	3.19%	82.15%	1.63%
2017-11-04	SSD	62.07%	3.73%	82.26%	1.91%
2017-06-29	Tencent-DPPR Team & USTB-PRIR	61.95%	40.30%	74.98%	27.56%
2019-12-04	Test-Msk_v2	59.23%	63.82%	68.71%	59.59%
2017-10-10	TextNetwork	52.59%	53.89%	68.57%	44.39%
2017-06-30	UM	51.01%	55.11%	65.47%	47.58%
2017-06-30	HappyCCL	49.57%	53.04%	64.82%	44.88%
2017-06-29	Text_Detection_DL	48.90%	61.35%	61.81%	60.90%
2017-06-30	SCUT-DLVClab	48.76%	42.04%	62.58%	31.65%
2017-06-28	SARI_FDU_RRPN	46.16%	43.65%	63.23%	33.33%
2017-07-01	SCUT-DLVClab-HuangGroup	41.79%	22.89%	57.53%	14.29%
2017-06-30	BRTRS-Detection	41.21%	49.69%	54.56%	45.62%
2017-10-26	EPTN-SJTU	33.75%	58.08%	54.31%	62.42%
2017-10-26	FTDN-SJTU	32.04%	56.39%	55.23%	57.61%
2017-06-29	TextFCN	29.57%	26.83%	43.78%	19.35%
2017-10-11	TextFCN V2	28.34%	30.11%	43.31%	23.08%
2017-10-06	SSD (Sravya)	6.26%	19.98%	13.44%	38.96%
2017-06-22	CNN-LSTM based text detection	6.19%	15.92%	20.52%	13.00%
2017-06-29	RFCN	3.41%	10.77%	7.56%	18.68%
2017-06-30	Cas-hoteye	0.26%	3.40%	4.96%	2.58%

Inactive evaluations

method: Sogou_MM2018-09-16

method: PATech-AILab2018-08-27

method: Foo & Bar2017-07-01

Ranking Table

Ranking Graphic

Ranking Graphic