P. Arbelaez, J. Pont-tuset, J. Barron, F. Marques, and J. Malik, Multiscale Combinatorial Grouping, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.49

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.433.2307

H. Bristow, J. Valmadre, and S. Lucey, Dense Semantic Correspondence Where Every Pixel is a Classifier, 2015 IEEE International Conference on Computer Vision (ICCV)
DOI : 10.1109/ICCV.2015.458

URL : http://arxiv.org/abs/1505.04143

X. Chen, R. Mottaghi, X. Liu, S. Fidler, and R. Urtasun, Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.254

URL : http://arxiv.org/abs/1406.2031

M. Cho, S. Kwak, C. Schmid, and J. Ponce, Unsupervised object discovery and localization in the wild: Part-based matching with bottom-up region proposals, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.3, 2015.
DOI : 10.1109/CVPR.2015.7298724

URL : https://hal.archives-ouvertes.fr/hal-01110036

C. Choy, J. Gwak, S. Savarese, and M. Chandraker, Universal correspondence network, Proc. Neural Info. Proc. Systems, 2008.

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

M. Everingham, L. Van-gool, C. Williams, J. Winn, and A. Zisserman, The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, 2007.
DOI : 10.1371/journal.pcbi.0040027

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.167.6629

L. Fei-fei, R. Fergus, and P. Perona, One-shot learning of object categories, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.4, pp.594-611, 2006.
DOI : 10.1109/TPAMI.2006.79

R. Girshick, Fast R-CNN, 2015 IEEE International Conference on Computer Vision (ICCV), p.5
DOI : 10.1109/ICCV.2015.169

B. Ham, M. Cho, C. Schmid, and J. Ponce, Proposal Flow, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
DOI : 10.1109/CVPR.2016.378

URL : https://hal.archives-ouvertes.fr/hal-01240281

X. Han, T. Leung, Y. Jia, R. Sukthankar, and A. C. Berg, MatchNet: Unifying feature and metric learning for patchbased matching, Proc. IEEE Conf. Comp. Vision Patt. Recog, 2008.

B. Hariharan, J. Malik, and D. Ramanan, Discriminative Decorrelation for Clustering and Classification, Proc. European Conf. Comp. Vision, pp.459-472, 2012.
DOI : 10.1007/978-3-642-33765-9_33

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.259.8110

K. He, X. Zhang, S. Ren, and J. Sun, Spatial pyramid pooling in deep convolutional networks for visual recognition, Proc. European Conf. Comp. Vision, 2014.
DOI : 10.1007/978-3-319-10578-9_23

URL : http://arxiv.org/abs/1406.4729

B. K. Horn and B. G. Schunck, ???Determining optical flow???: a retrospective, Artificial Intelligence, vol.59, issue.1-2, pp.81-87, 1993.
DOI : 10.1016/0004-3702(93)90173-9

URL : https://deepblue.lib.umich.edu/bitstream/2027.42/30981/1/0000654.pdf

J. Hosang, R. Benenson, P. Dollár, and B. Schiele, What Makes for Effective Detection Proposals?, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, issue.4, 2006.
DOI : 10.1109/TPAMI.2015.2465908

URL : http://arxiv.org/pdf/1502.05082

J. Hur, H. Lim, C. Park, and S. C. Ahn, Generalized Deformable Spatial Pyramid: Geometry-preserving dense correspondence estimation, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298745

A. Kanazawa, D. W. Jacobs, and M. Chandraker, WarpNet: Weakly Supervised Matching for Single-View Reconstruction, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
DOI : 10.1109/CVPR.2016.354

URL : http://arxiv.org/abs/1604.05592

I. Kemelmacher-shlizerman and S. M. Seitz, Collection flow, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6247876

J. Kim, C. Liu, F. Sha, and K. Grauman, Deformable Spatial Pyramid Matching for Fast Dense Correspondences, 2013 IEEE Conference on Computer Vision and Pattern Recognition
DOI : 10.1109/CVPR.2013.299

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.362.8285

S. Kim, D. Min, B. Ham, S. Jeon, S. Lin et al., Fcss: Fully convolutional self-similarity for dense semantic correspondence, Proc. IEEE Conf. Comp. Vision Patt. Recog., 2017. 1

S. W. Kim, D. Min, B. Ham, and K. Sohn, Dasc: Dense adaptative self-correlation descriptor for multi-modal and multispectral correspondence, Proc. IEEE Conf. Comp. Vision Patt. Recog, 2015.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Proc. Neural Info. Proc. Systems, 2012.
DOI : 10.1162/neco.2009.10-08-881

C. Liu, J. Yuen, and A. Torralba, Nonparametric Scene Parsing via Label Transfer, IEEE Trans. Patt. Anal. Mach. Intell, vol.33, issue.12 8, pp.2368-2382, 2011.
DOI : 10.1007/978-3-319-23048-1_10

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.221.555

C. Liu, J. Yuen, and A. Torralba, SIFT Flow: Dense Correspondence Across Scenes and Its Applications, IEEE Trans. Patt. Anal. Mach. Intell, vol.33, issue.5, pp.978-994, 2008.
DOI : 10.1007/978-3-319-23048-1_2

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.175.1740

J. L. Long, N. Zhang, and T. Darrell, Do convnets learn correspondence?, Proc. Neural Info. Proc. Systems, 2014.

D. G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

S. Manen, M. Guillaumin, and L. Van-gool, Prime Object Proposals with Randomized Prim's Algorithm, 2013 IEEE International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2013.315

J. Matas, O. Chum, M. Urban, and T. Pajdla, Robust wide-baseline stereo from maximally stable extremal regions, Image and Vision Computing, vol.22, issue.10, pp.761-767, 2004.
DOI : 10.1016/j.imavis.2004.02.006

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.671.8241

M. Okutomi and T. Kanade, A multiple-baseline stereo, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.15, issue.4, pp.353-363, 1993.
DOI : 10.1109/34.206955

URL : http://repository.cmu.edu/cgi/viewcontent.cgi?article=3012&context=compsci

Y. Peng, A. Ganesh, J. Wright, W. Xu, and Y. Ma, RASL: Robust alignment by sparse and low-rank decomposition for linearly correlated images, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2233-2246, 2012.
DOI : 10.1109/CVPR.2010.5540138

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.171.472

J. Revaud, P. Weinzaepfel, Z. Harchaoui, and C. Schmid, Deepmatching: Hierarchical deformable dense matching. ArXiv e-prints, 2015.
DOI : 10.1007/s11263-016-0908-3

URL : https://hal.archives-ouvertes.fr/hal-01148432

C. Rhemann, A. Hosni, M. Bleyer, C. Rother, and M. Gelautz, Fast cost-volume filtering for visual correspondence and beyond, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995372

E. Simo-serra, E. Trulls, L. Ferraz, I. Kokkinos, P. Fua et al., Discriminative Learning of Deep Convolutional Feature Point Descriptors, 2015 IEEE International Conference on Computer Vision (ICCV), p.8, 2015.
DOI : 10.1109/ICCV.2015.22

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale visual recognition, Proc. IEEE Conf. Comp. Vision Patt. Recog, p.8, 2014.

T. Taniai, S. N. Sinha, and Y. Sato, Joint Recovery of Dense Correspondence and Cosegmentation in Two Images, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
DOI : 10.1109/CVPR.2016.460

M. Tau and T. Hassner, Dense Correspondences across Scenes and Scales, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, issue.5, 2015.
DOI : 10.1109/TPAMI.2015.2474356

URL : http://arxiv.org/abs/1406.6323

E. Tola, V. Lepetit, and P. Fua, DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.5, pp.815-830, 2010.
DOI : 10.1109/TPAMI.2009.77

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.168.5084

J. R. Uijlings, K. E. Van-de-sande, T. Gevers, and A. W. Smeulders, Selective Search for Object Recognition, International Journal of Computer Vision, vol.57, issue.1, pp.154-171, 2013.
DOI : 10.1023/B:VISI.0000013087.49260.fb

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.361.3382

J. R. Uijlings, K. E. Van-de-sande, T. Gevers, and A. W. Smeulders, Selective Search for Object Recognition, International Journal of Computer Vision, vol.57, issue.1, pp.154-171, 2013.
DOI : 10.1023/B:VISI.0000013087.49260.fb

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.361.3382

P. Weinzaepfel, J. Revaud, Z. Harchaoui, and C. Schmid, DeepFlow: Large Displacement Optical Flow with Deep Matching, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.175

URL : https://hal.archives-ouvertes.fr/hal-00873592

H. Yang, W. Lin, and J. Lu, DAISY Filter Flow: A Generalized Discrete Approach to Dense Correspondences, 2014 IEEE Conference on Computer Vision and Pattern Recognition
DOI : 10.1109/CVPR.2014.435

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.673.6401

Y. Yang and D. Ramanan, Articulated Human Detection with Flexible Mixtures of Parts, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.12, pp.2878-2890, 2013.
DOI : 10.1109/TPAMI.2012.261

K. M. Yi, E. Trulls, V. Lepetit, and P. Fua, LIFT: Learned Invariant Feature Transform, Proc. European Conf. Comp. Vision, 2016.
DOI : 10.1109/TPAMI.2009.167

URL : http://arxiv.org/abs/1603.09114

S. Zagoruyko and N. Komodakis, Learning to compare image patches via convolutional neural networks, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
DOI : 10.1109/CVPR.2015.7299064

URL : https://hal.archives-ouvertes.fr/hal-01246261

J. Zbontar and Y. Lecun, Computing the stereo matching cost with a convolutional neural network, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298767

J. Zbontar and Y. Lecun, Stereo matching by training a convolutional neural network to compare image patches, Journal of Machine Learning Research, vol.17, issue.1 2, pp.1-32, 2016.
DOI : 10.1109/cvpr.2015.7298767

URL : http://arxiv.org/abs/1409.4326

T. Zhou, Y. J. Lee, S. X. Yu, and A. A. Efros, FlowWeb: Joint image set alignment by weaving consistent, pixel-wise correspondences, Proc. IEEE Conf. Comp. Vision Patt. Recog, 2008.

T. Zhou, P. Krähenbühl, M. Aubry, Q. Huang, and A. A. Efros, Learning Dense Correspondence via 3D-Guided Cycle Consistency, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
DOI : 10.1109/CVPR.2016.20

URL : http://arxiv.org/abs/1604.05383

C. L. Zitnick and P. Dollár, Edge Boxes: Locating Object Proposals from Edges, Proc. European Conf. Comp. Vision, 2014.
DOI : 10.1007/978-3-319-10602-1_26

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.453.5208