Qian Ge

I am an Electrical Engineering Ph.D. candidate at North Carolina State University. I am working with Dr. Edgar Lobaton at Active Robotic Sensing (ARoS) Laboratory on computer vision and machine learning. My current research mainly focuses on robust image segmentation for different applications, including obstacle detection for autonomous cars, topology preserving segmentation for learning the shell structure of small organisms, and consensus-based natural image segmentation. I also like digging into various deep learning models by implementing models and reproducing experiments of state-of-the-art research.

Education

Aug 2019	Ph.D. in Electrical Engineering North Carolina State University, Raleigh, NC, USA
June 2011	M.S. in Electrical Engineering University of Electronic Science and Technology of China, Chengdu, P.R. China
July 2008	B.S. in Electrical Engineering University of Electronic Science and Technology of China, Chengdu, P.R. China

Skills

Computer Languages	Python, MATLAB, C/C++
Frameworks/Toolbox	TensorFlow, NumPy, Scikit-learn, OpenCV, Git
Professional	Computer Vision, Image Segmentation and Classification, Object Detection, Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Generative Adversarial Networks (GANs), Visual Attention Models

Work Experience

Jan 2018 -- May 2018

Research Aid Intern, Argonne National Laboratory, Lemont, IL
Primarily worked on validation of power system models and applying machine learning algorithms to learn load models. Designed a RNN based load demand forecasting model and achieved state-of-the-art performance.
[details]

Open Source Projects

	YOLOv3 for Object Detection TensorFlow implementation of YOLOv3 object detection for both inference and training. A ready-to-use pre-trained model converted from official implementation were provided (80 object classes trained on COCO dataset). [details][code] Provided an instruction for converting trained model stored in official .weigth file to .npy file. Provided customized training blocks, including bounding box clustering, data augmentation and multi-scale training. Trained on PASCAL VOC dataset for 20 object classes detection on natural images.
	Person Re-Identification with Triplet Loss TensorFlow implementation of person re-identification using triplet loss with batch hard mining training strategy. Re-ranking was used during person image retrieval. [code]
	Implementations of Generative Adversarial Networks (GANs) Implementions of various GANs models for comparison and testing the training behaviors of different GANs. Applied on MNIST dataset and CelebA human face dataset. [details][code] Currently, DCGAN, LSGAN and InfoGAN are implemented and sucessfully trained on both dataset. Generated face images with controlled context, such as emotion, hairstyle and azimuth, in an unsupervised manner by using InfoGAN.
	Adversarial Autoencoders for Variational Inference and Semi-Supervised Learning Provided an implemented of adversarial autoencoders (AAE) which utilize the GAN framework as a variational inference algorithm. Applied for semi-supervised learning and disentangling style and content of images. [code]
	Image to Image Translation with Conditional GANs Reconstructed building facade photos from label maps and generated shoes photos from sketches using pix2pix conditional GANs. [code]
	Visualization CNN for Interpretation of Trained Models Provided interpretation of trained CNN models by visualizing the learned features and the image regions where the models pay attention to. [details][code] Features were visualized through Transposed convolutional network and guided back propagation Attention regions were visualized through Class Activation Mapping and Gradient-weighted Class Activation Mapping.
	Image Classification using Recurrent Attention Model Implementation of recurrent visual attention model for image classification. This model reduces the computational complexity by only focusing on a sequence of small regions of the image, which is controlled by a RNN. [details][code] Achieved 97.82% accuracy on 60 x 60 translated MNIST. Provided interpretation of the classification results by visualizing the attention regions during inference.
	VGG and GoogleNet for Image Classification and Feature Extraction Implemented VGG and GoogleNet (Inceptionv1) image classification for training, inference and feature extraction. [details][VGG code] [GoogleNet code] Modified VGG to a fully convolutional network by using a global average pooling layer to accept arbitrary size of images during inference. Designed a modified Inception network for training on low resolution dataset from scratch (achieved 93.64% accuracy on CIFAR-10 testing set).
	Image and Video Style Transfer using Fast Style Transfer and Neural Style Implemented the fast style transfer to transfer images and videos to a specific artistic style in nearly real-time, and implemented the neural style transfer for image style transfer. [fast style code] [neural style code]

Research Experience

	A Visual System for Autonomous Foraminifera Identification Foraminifera are single-celled organisms with shells which are useful in petroleum exploration, biostratigraphy, paleoecology and paleobiogeography. We developed an automated system for identification of foraminifera species to reduce the human efforts on manually picking thousands of samples from ocean sediments. We also created a foraminifera image dataset and proposed novel robust edge detection algorithms on this dataset. [details] [project page] Leaded the creation of a foraminifera image dataset containing 1437 samples and 457 manually segmentation samples. Created synthetic images refined by GANs for data augmentation. Developed a coarse-to-fine fully convolutional edge detection network which iteratively applies edge detection modules on predicted edge maps. Achieved 0.91 edge F1 score on the foraminifera dataset for finding vague edges between foraminifera chambers with similar texture. Designed a topology-based metric to measure the structural difference between two edge maps. Developed a training process utilizing the topological metric to train an edge detection network which focuses on preserving topological structures of edges. Improved edge F1 score from 0.91 to 0.93 and segmentation IoU from 0.80 to 0.82. Built a transfer learning process for identification of six foraminifera species using features extracted from pre-trained VGG, Inception and ResNet.
	Robust Traffic Scenes Obstacle Detection and Image Segmentation We proposed a persistent homology based image segmentation framework which is robust to image qualities and parameter selection. The application areas for this framework include autonomous driving systems and segmentation of natural and biological images. [details] [presentation] Proposed a persistent homology based image segmentation framework which is robust to image qualities and parameter selection. Designed a pipeline for traffic scene obstacle detection based on this framework by extracting persistence regions in occupancy grids computed from disparity maps. Demonstrated that the traffic scene obstacle detection pipeline is robust to input image quality through experiments on KITTI dataset. Designed a consensus-based image segmentation based on this framework for robustly extracting consensus information from a segmentation result set generated by different segmentation algorithms. Achieved better performance over a wide range of parameters than any input algorithm with its best parameter setting on Berkeley Segmentation Database.
	Exploring Victorian Illustrated Newspaper Data through Computer Vision Techniques The aim of this project is to explore how computer vision and image processing techniques can be adapted for large-scale interpretation of historical materials. We applied several computer vision techniques on a set of nineteenth-century illustrated British newspapers to test the feasibility of these techniques for analyzing large collections of historical illustrations. [details] [project page] Created a Victorian newspaper illustration dataset by extracting illustration regions from scanned newspaper pages with high accuracy. Developed a Fourier transform based feature to distinguish line engravings and halftone images for tracking the presence of halftone images in late nineteenth-century British newspapers. Extracted specific scenes such as portraits, crowds, buildings and weather charts using k-means, KNN and SVM based on GIST descriptor.
	Non-Rigid Image Registration with Uncertainty Analysis We proposed a novel non-rigid image registration methodology which can be applied to medical images as well as natural images. We also provided the uncertainty bounds to characterize the registration accuracy over the entire image domain. [details] [poster] Developed a topological-based correspondence point matching algorithm under a Lipschitz non-rigid deformation with zero false negative rate and high precision. Extended the point matching to region registration by solving a graph matching problem with geometric constraints. Developed an approach to quantify the uncertainty of the region registration.

Publications

Automated species-level identification of planktic foraminifera using convolutional neural networks, with comparison to human performance
R. Mitra, T. Marchitto, Q. Ge, B. Zhong, B. Kanakiya, M.S. Cook, J.S. Fehrenbacher, J.D. Ortiz, A. Tripati and E. Lobaton
Marine Micropaleontology 2019
[1] [abs] [link]

Image Analytics and the Nineteenth-Century Illustrated Newspaper
P. Fyfe and Q. Ge
Journal of Cultural Analytics 2018
[2] [link]

Obstacle Detection in Outdoor Scenes based on Multi-Valued Stereo Disparity Maps
Q. Ge and E. Lobaton
IEEE Symp. Series Comput. Intell. (SSCI) 2017
[3] [abs] [pdf]

Coarse-to-Fine Foraminifera Image Segmentation through 3D and Deep Features
Q. Ge, B. Zhong, B. Kanakiya, R. Mitra, T. Marchitto, and E. Lobaton
IEEE Symp. Series Comput. Intell. (SSCI) 2017
[4] [abs] [pdf]

A Comparative Study of Image Classification Algorithms for Foraminifera Identification
B. Zhong, Q. Ge, B. Kanakiya, R. Mitra, T. Marchitto, and E. Lobaton
IEEE Symp. Series Comput. Intell. (SSCI) 2017
[5] [abs] [pdf]

Consensus-Based Image Segmentation via Topological Persistence
Q. Ge and E. Lobaton
IEEE Conf. on Comput. Vis. Pattern Recognit. Workshops (CVPRW) 2016
[6] [abs] [pdf]

Robust Multi-Target Tracking in Outdoor Traffic Scenarios via Persistence Topology based Robust Motion Segmentation
S. Chattopadhyay, Q. Ge, C. Wei, and E. Lobaton
IEEE Global Conf. Signal Inf. Process. (GlobalSIP) 2015
[7] [abs] [pdf]

Robust Obstacle Segmentation based on Topological Persistence in Outdoor Traffic Scenes
C. Wei, Q. Ge, S. Chattopadhyay, and E. Lobaton
IEEE Symp. Series Comput. Intell. (SSCI) 2014
[8] [abs] [pdf]

Manifold Learning Approach to Curve Identification with Applications to Footprint Segmentation
N. Lokare, Q. Ge, W. Snyder, Z. Jewell, S. Allibhai, and E. Lobaton
IEEE Symp. Series Comput. Intell. (SSCI) 2014
[9] [abs] [pdf]

Non-Rigid Image Registration under Non-Deterministic Deformation Bounds
Q. Ge, N. Lokare, and E. Lobaton
10th International Symposium on Medical Information Processing and Analysis 2014
[10] [abs] [pdf]

Teaching Experience

F2018	Neural Networks (NCSU ECE 542), TA
F2016	Applications of Graphs and Graphical Models (NCSU ECE/CSC 792), TA
F2015	Computer Systems Programming (NCSU ECE 209), TA

Last updated on 2019-01-28