Ringo Chu

Aloha my name is Ringo and welcome to my page! I am a pre-doctoral researcher at SenseTime Research. Previously I completed my undergrad degree in Computer Science at University College London. My research lies on computational imaging, scene understanding (e.g visual tracking/relation), and embedded ML sys. I was a research intern with Intel, where I worked on Sim2Real Robotics and probabilistic finance models. Before that, I was a robotic research intern at Dyson working on monocular visual odometry for the Eye 360. Prior to that, I was a renowned researcher at Imperial College London Custom Computing, where I worked on fast 3D Convolution neural networks for NASA AVRIS imaging and inference acceleration.

Apart from being a nerd, I am a bookworm, a tennis player. I also collect cassette tapes. If you want to know more about me, you can look at my Google Scholar, Facebook, LinkedIn , my resume or even contact me in person. Even though I am temporarily away, I remain reachable and always keen to chat and answer your queries :)

Publication and Projects

Below is the collection of my researches and some of my interesting projects! (That can be displayed in the public)

Visual Relation Learning through Transformer Network


My Current Project with LUKADVISOR investigating the use of Transformer to solve Visual Relation Detection. Visual relation is by far one of the hardest vision-language tasks to solve as it combines vision and language models to reason an image. Traditionally, VRD is tackled by constructing graphs or guided by language models. Motivated by the recent success of Transformer in linguistic and vision tasks, this project aims to use the transformer architecture as a Look-Up-Table to directly predict the predicate(relation) between subject-object pairs.

(A Closer Look at) Neural RGB+𝔻 Object Tracking with Depth Estimation


This project introduces the first neural approach to support 2D tracking using approximated depth information. We assume that objects should have small displacement and small change in depth displacement between frames(constant velocity assumption), and apply this assumption to a visual object tracker and a multiple object tracker using depth information generated from a monocular depth estimation network. Based on preliminary quantitative results, the proposed approach has shown promise in occlusion detection for tracking of single and multiple generic objects.

Quantization Sim2Real Robotic Arm Control



Ho-Cheung Ng, Steven Liu, Izaak Coleman, Ringo Chu, Man-Chung Yue, Wayne Luk
The International Conference on Field-Programmable Technology, 2020
[Paper] [Bibtex]

Abstract:Recent advancements in the throughput of next generation sequencing machines pose a huge computational challenge in analyzing the massive quantities of sequenced data produced. A critical initial step of genomic data analysis is short read alignment, which is a bottleneck in the analysis workflow. This paper explores the use of a reconfigurable architecture to accelerate this process, based on the seed-and-extend model of Bowtie2. In the proposed approach, complete information available in sequencing data is integrated into an FPGA alignment pipeline for biologically accurate runtime acceleration. Experimental results show that our architecture achieves a similar alignment rate compared to Bowtie2, mapping reads around twice as fast. Particularly, the alignment time is reduced from 50 minutes to 26 minutes when processing 300M reads.


Martin Ferianc, Hongxiang Fan, Ringo S.W. Chu, Jakub Stano, Wayne Luk
Applied Reconfigurable Computing (ARC), 2020
[Paper] [Bibtex]

Featured here!!! -> this
Abstract: A method for exploring the design space and latency prediction for FPGAs using a Gaussian process parametrized by an analytic approximation and coupled with runtime data. Experiments were conducted on three CNNs accelerator targeting Intel Arria 10 GX 1150, demonstrating a 30.7% improvement in accuracy w.r.t MAE against a standard analytic method in leave-one-out cross-validation.


Ringo Chu, Ho-Cheung Ng, Xiwei Wang, Wayne Luk
IEEE Symposium On Geoscience and Remote Sensing (IGARSS), 2019
[Paper] [Code] [Bibtex]

A new deep-learning model to extract 3D features at spatial and spectral levels simultaneously to address the classic problem of high dimensionality on HSIs. Experiments were conducted using scenes acquired by NASA Airborne Visible/Infra-Red Imaging Spectrometer and have shown competitive performance.

{Steven Liu*, Ringo Chu*}, Xiwei Wang, Wayne Luk
Applied Reconfigurable Computing (ARC), 2019
[Paper] [Code] [Bibtex]

An FPGA accelerator for hyperspectral imagery with an optimized architecture that is faster than GPUs and consume less power. The proposal method is suitable and ideal for real time inference deployment.

Multiple Agents Artificial Stock Market Modelling


A research project to model a stock market with agents of many behaviours. A probabilistic method is used to analyse the stochasticity of processes and collectives of agents. The model is also benchmarked on Intel Xeon multi-core CPU and Altera FPGA.

LSTM verses Gaussian Process Regression


Me, 2018 Summer
Short Summary

A very very very brief comparison between Long-Short term memory and Gaussian Process Regression on stock predictions based on their interpretabilities.

Mindstorm Robots


My childhood, and then recently

First prototype born in 2015(old photo and rare footage). This is a 35cm x 35cm x 40cm sized robot built in Lego only! The robot was initially constructed with a Sony phone attached as an explorer and as a Mars rover capable of broadcasting live motions. Later I experimented some Reinforcement learning algorithms with the robot for solving maze and learning to escape from being stuck.

[email protected]