Jay Taylor's notes

back to listing index


[web search]
Original source (github.com)
Tags: deep-learning academic-paper github.com
Clipped on: 2016-11-09

Skip to content
Open this file in GitHub Desktop
100 lines (80 sloc) 8.16 KB


Papers about deep learning ordered by task, date. Current state-of-the-art papers are labelled.

Object Recognition

  • Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition, okt 2016, IBM, paper
  • T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos, aug 2016, github, arxiv
  • Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning, aug 2016, Google, arxiv
  • Residual Networks of Residual Networks: Multilevel Residual Networks, aug 2016, arxiv
  • Training Region-based Object Detectors with Online Hard Example Mining, apr 2016, Facebook, arxiv
  • Deep Residual Learning for Image Recognition, dec 2015, arxiv
  • SSD: Single Shot MultiBox Detector, dec 2015, Google, github, arxiv Image (Asset 3/3) alt=
  • ParseNet: Looking Wider to See Better, jun 2015, arxiv
  • You Only Look Once: Unified, Real-Time Object Detection, jun 2015, Facebook, arxiv
  • Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, jun 2015, Microsoft/Facebook arxiv
  • Selective Search for Object Recognition, 2012, paper
  • Rich feature hierarchies for accurate object detection and semantic segmentation, 2014, paper

Pose Estimation

  • Fast Single Shot Detection and Pose Estimation, sep 2016, arxiv

Face Recognition

  • Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition, paper
  • OpenFace: A general-purpose face recognition library with mobile applications, June 2016, paper
  • Deep Face Recognition, 2015, paper
  • Compact Convolutional Neural Network Cascade for Face Detection, aug 2015, arxiv
  • Learning Robust Deep Face Representation, Jul 2015, arxiv
  • FaceNet: A Unified Embedding for Face Recognition and Clustering, jun 2015, paper
  • Multi-view Face Detection Using Deep Convolutional Neural Networks, yahoo, feb 2015, arxiv

Style Transfer

Logo Recognition

  • Automatic Graphic Logo Detection via Fast Region-based Convolutional Networks, apr 2016, arxiv
  • Logo Localization and Recognition in Natural Images Using Homographic Class Graphs, 2016, paper
  • LOGO-Net: Large-scale Deep Logo Detection and Brand Recognition with Deep Region-based Convolutional Networks, nov 2015, arxiv
  • DeepLogo: Hitting Logo Recognition with the Deep Neural Network Hammer, okt 2015, Berkely, arxiv
  • Automatic detection of logos in video and their removal using inpainting, jul 2015, paper
  • On the Benefit of Synthetic Data for Company Logo Detection, 2015, paper
  • Fast and Robust Realtime Storefront Logo Recognition, paper
  • Scalable Logo Recognition in Real-World Images, 2011, paper
  • https://arxiv.org/pdf/1609.01414v1.pdf

note: also includes some papers that use SIFT

Text (in the Wild) Recognition

  • COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images, jun 2016, arxiv
  • Recursive Recurrent Nets with Attention Modeling for OCR in the Wild, mar 2016, arxiv
  • Efficient Scene Text Localization and Recognition with Local Character Refinement, apr 2015, arxiv
  • Reading Text in the Wild with Convolutional Neural Networks, dec 2014, arxiv
  • Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition, jun 2014, arxiv
  • Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning, 2011, paper

ConvNet visualization

  • Visualizing and Understanding Convolutional Networks, Nov 2013, arxiv

Image Segmentation

  • SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, dec 2015, arxiv

Pedestrian Detection

  • Joint Deep Learning for Pedestrian Detection, 2013, paper

Super Resolution

  • Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network, sep 2016, Twitter, arxiv

Automated Theorem Proving

  • DeepMath - Deep Sequence Models for Premise Selection, jun 2016, Google arxiv

Reverse Engineering

  • Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data, okt 2016, arxiv
  • Stealing Machine Learning Models via Prediction APIs, aug 2016, paper



Architecture and optimization

  • SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size, feb 2016, arxiv

Tools for Deep Learning

Tools for Papers

Data Sets

  • MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition, jul 2016, arxiv
  • Family in the Wild (FIW): A Large-scale Kinship Recognition Database, apr 2016, arxiv
  • https://github.com/openimages/dataset
  • YouTube-8M: A Large-Scale Video Classification Benchmark, sep 2016, Google, arxiv