What are the most recent developments in state-of-the-art machine learning (ML) systems, and deep learning models, for computer vision detection and classification? Specifically the architectures.

Part

of one

Part

What are the most recent developments in state-of-the-art machine learning (ML) systems, and deep learning models, for computer vision detection and classification? Specifically the architectures.

Hello, and thanks for your question asking what are the most recent developments in state-of-the-art machine learning (ML) systems, and deep learning models, for computer vision detection and classification.  I have organized 36 links to research into categories.  Below you will find a deep dive into my research, along with all the details as to how I came to this conclusion.

OVERVIEW
In order to answer your question I reviewed the information that you were already familiar with, and have then provided links to the research mentioned in each of the articles that you mentioned, and more.  I have arranged these into categories, and have summarized which category appears most in this recent research.

FINDINGS
TENSORFLOW
- TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems Mar 2016 
This is the paper for TensorFlow described in the Google Research Blog article "Supercharge your Computer Vision models with the TensorFlow Object Detection API."

- TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks Aug 2017
This TensorFlow paper provides comparisons of various aspects of machine learning.

SSD
- SSD: Single Shot MultiBox Detector Dec 2016 
This link was included in the Google Research Blog article "Supercharge your Computer Vision models with the TensorFlow Object Detection API."

INCEPTION
- Rethinking the Inception Architecture for Computer Vision Dec 2015 
This is an earlier paper about Inception which is described in the Medium article "An Intuitive Guide to Deep Network Architectures."

- Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning Aug 2016  
This is the more recent paper about Inception which is described in the Medium article "An Intuitive Guide to Deep Network Architectures."

RESNET
- Deep Residual Learning for Image Recognition Dec 2015 
This is the ResNet paper described in the Medium article "An Intuitive Guide to Deep Network Architectures."

FASTER R-CNN
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Jan 2016 
This is the paper for Faster R-CNN which is described in the Google Research Blog article "Supercharge your Computer Vision models with the TensorFlow Object Detection API."

- Speed/accuracy trade-offs for modern convolutional object detectors Apr 2017 
This link was included in the Google Research Blog article "Supercharge your Computer Vision models with the TensorFlow Object Detection API"; the abstract says it is meant "to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance."

FULLY CONVOLUTIONAL NETWORKS
- Xception: Deep Learning with Depthwise Separable ConvolutionsApr 2017 
This is the Xception paper described in the Medium article "An Intuitive Guide to Deep Network Architectures."

- Towards a New Interpretation of Separable Convolutions 2017 
This references the other Xception paper but draws separate conclusions.

- MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications Apr 2017 
This is the paper for MobileNets which is referenced in the Google Research Blog article "Supercharge your Computer Vision models with the TensorFlow Object Detection API."

- Max-Margin Object Detection Jan 2015 
This paper is about Max-Margin Object Detection and includes the Histogram of Oriented Gradients (HOG) and sliding window framework described in the Medium article "You Only Look Twice — Multi-Scale Object Detection in Satellite Imagery With Convolutional Neural Networks (Part I)" but concludes better results were obtained with the MMOD than without.

- Convolutional neural network architecture for geometric matching Apr 2017 
This paper deals with the convolutional neural network and geometric matching.

- Irregular Convolutional Neural Networks Jun 2017 
This paper deals with Irregular Convolutional Neural Networks.

- Deformable Convolutional Networks June 2017 
This paper deals with deformable convolutional networks

- Going Deeper with Convolutions Sep 2014 
This paper deals with convolutional neural networks, particularly Inception, as well as GoogLeNet.

- DSSD : Deconvolutional Single Shot Detector Jan 2017 
This paper deals with improvements to object detection, particularly involving small objects which I believe relates to You Only Look Twice.

- ImageNet Classification with Deep Convolutional Neural Networks 
This is the paper for AlexNet.

- One weird trick for parallelizing convolutional neural networks Apr 2014 
This paper does not have a lengthy abstract but is by the same primary author as that of AlexNet and deals with improved training of CNNs.

- Very Deep Convolutional Network for Large-Scale Image Recognition 2015 
This paper includes information on VGG-16.

- Fully Convolutional Networks for Semantic Segmentation May 2016 
Paper on fully convolutional networks.

CRFS
- Structured Image Classification from Conditional Random Field with Deep Class Embedding May 2017 
This is the most recent paper I found which discussed conditional random fields (CRF), included in the client's "Architectures" summary.

- Amortized Inference and Learning in Latent Conditional Random Fields for Weakly-Supervised Semantic Image Segmentation May 2017
This paper deals with CRFs and semantic image segmentation.

- 2D-3D Pose Consistency-based Conditional Random Fields for 3D Human Pose Estimation Apr 2017
This is an additional paper concerning CRFs.

- Conditional Random Fields as Recurrent Neural Networks Apr 2016 
Another paper concerning CRFs.

- Higher Order Conditional Random Fields in Deep Neural Networks Jul 2016 
A further paper concerning CRFs.

GAN
- Generative Adversarial Networks Jun 2014 
This paper gives an overview of Generative Adversarial Networks.

- Improved Techniques for Training GANs Jun 2016 
This paper is about General Adversarial Models.

MISCELLANEOUS
-Microsoft COCO: Common Objects in Context 
This paper is about the COCO dataset, used in the detection models.

- You Only Look Once: Unified, Real-Time Object Detection May 2016 
This paper is on the topic of a YOLO (You Only Look Once) model.

- Image-to-Image Translation with Conditional Adversarial Networks Nov 2016 
This paper discusses "pix-to-pix".

- GitHub 
This is the GitHub page for the TensorFlow model zoo

- ImageNet Large Scale Visual Recognition Challenge Jan 2015 
This is a paper about the annual ImageNet competition and the advancements in object recognition which have resulted from it.

- Real-Time Facial Segmentation and Performance Capture from RGB Input Apr 2016 
This paper includes information on EDeconvNet which is included in the client's "Architectures" summary.

- Comparing the Performance of L*A*B* and HSV Color Spaces
with Respect to Color Image Segmentation Feb 2015
This paper deals with image processing, particularly HSV

- Learning non-maximum suppression May 2017 
This paper deals with "non-max suppression".

CONCLUSION
Overall I have found that the most recent developments in state-of-the-art machine learning (ML) systems, and deep learning models, for computer vision detection and classification are based on fully convolutional models.  I have been able to provide links to 36 examples of recent research, and have arranged them into categories.

Thanks for using Wonder, please let us know if there's anything else that we can help you with!

Wonder

What are the most recent developments in state-of-the-art machine learning (ML) systems, and deep learning models, for computer vision detection and classification? Specifically the architectures.

Delivered August 21st, 2017

What are the most recent developments in state-of-the-art machine learning (ML) systems, and deep learning models, for computer vision detection and classification? Specifically the architectures.

Did this report spark your curiosity?

[1512.03385] Deep Residual Learning for Image Recognition

[1602.07261] Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

[1512.00567] Rethinking the Inception Architecture for Computer Vision

[1610.02357] Xception: Deep Learning with Depthwise Separable Convolutions

Towards a New Interpretation of Separable Convolutions

[1603.04467] TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

[1512.02325] SSD: Single Shot MultiBox Detector

[1704.04861] MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

[1506.01497] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

[1611.10012] Speed/accuracy trade-offs for modern convolutional object detectors

Microsoft COCO: Common Objects in Context

[1506.02640] You Only Look Once: Unified, Real-Time Object Detection

[1611.07004] Image-to-Image Translation with Conditional Adversarial Networks

[1606.03498] Improved Techniques for Training GANs

[1406.2661] Generative Adversarial Networks

tensorflow/models

[1409.0575] ImageNet Large Scale Visual Recognition Challenge

Max-Margin Object Detection

[1703.05593] Convolutional neural network architecture for geometric matching

[1706.07966] Irregular Convolutional Neural Networks

[1703.06211] Deformable Convolutional Networks

[1409.4842] Going Deeper with Convolutions

[1701.06659] DSSD : Deconvolutional Single Shot Detector

ImageNet Classification with Deep Convolutional Neural Networks

[1404.5997] One weird trick for parallelizing convolutional neural networks

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Real-Time Facial Segmentation and Performance Capture from RGB Input

[1705.07420] Structured Image Classification from Conditional Random Field with Deep Class Embedding

[1705.01262] Amortized Inference and Learning in Latent Conditional Random Fields for Weakly-Supervised Semantic Image Segmentation

[1704.03986] 2D-3D Pose Consistency-based Conditional Random Fields for 3D Human Pose Estimation

[1502.03240] Conditional Random Fields as Recurrent Neural Networks

[1511.08119] Higher Order Conditional Random Fields in Deep Neural Networks

[1605.06211] Fully Convolutional Networks for Semantic Segmentation

Comparing the Performance of L*A*B* and HSV Color Spaces with Respect to Color Image Segmentation

[1705.02950] Learning non-maximum suppression

[1708.02637] TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks

Comparing the Performance of LAB* and HSV Color Spaces with Respect to Color Image Segmentation