Abstract: Recently, deep structured networks such as deep convolutional (CNN) and recurrent (RNN) neural networks have enjoyed great success. However, similar to most machine learning techniques, current deep learning approaches rely on conventional statistics and restricted to a specific problem formulation. In particular, they are designed to learn a model for a distribution (or a function) that maps a structured input, typically a vector, matrix, or tensor, to a structured output. Due to this limitation, deep learning has not had any success or significant impact in some applications. In fact, many problems such as object detection, graph problems, and multi-target tracking, are naturally expressed with sets of elements rather than vectors. As opposed to a vector, the size of a set is not fixed in advance, and it is invariant to the ordering of its elements. Therefore, learning approaches built based on conventional statistics cannot be used directly for these problems. In the talk, I will discuss deep learning approaches that use finite set statistics and point processes instead of conventional statistics, as a step towards the set learning problem. I will also highlight further our recent work.
Dr. Hamid Rezatofighi is a Senior Research Fellow at the Australian Centre for Visual Technologies (ACVT) at the University of Adelaide, and an Associated Research Fellow with the Australian Centre for Robotic Vision. He received his PhD from the Australian National University in 2015. Since 2011, he has been actively working in the field of object detection and multiple object tracking with research expertise in Bayesian filtering, estimation and learning using point process and finite set statistics. He has applied his expertise on many different, but closely related applications in computer vision, medical imaging, robotics and defence. He has recently started working on an emerging field of study in machine learning, known as set learning.