Ziliang Zong - Research

My current and previous research/education projects have covered a broad range of topics, which include energy-efficient computing and systems, parallel computing, big data analytics, mobile computing, AI, and edge computing.

Deep artificial neural network technology has been widely used to solve many challenging tasks in computer vision, natural language processing, speech recognition, and more. Most of today's deep learning algorithms are designed for high performance servers and running in the cloud. As the edge devices (e.g., mobile phones and smart watches) become more capable and the advantages of on-device artificial intelligence (AI) (e.g. protecting privacy, working without a network, processing data locally in real-time) become more evident, bringing AI to the edge will be inevitable. However, the limited resources (e.g., computation, memory, and battery) of edge devices bring a whole new level of challenges: (1) On-device AI must keep the model size small without sacrificing accuracy; (2) On-device AI must keep the power usage low; (3) Future on-device AI should enable efficient processing and analysis on multi-modal data (e.g., video, audio, and text); and (4) On-device AI should be interpretable and reproducible. This project aims to address these challenges by (1) exploring innovative machine learning algorithms (e.g., multi-task learning) for multi-modal data analysis; (2) exploring multi-modal pruning algorithms (reducing the neural network size without compromising accuracy) that can be applied on edge devices; (3) investigating and explaining how pruning works and using the derived theory to guide further pruning optimization; and (4) improving the energy efficiency of on-device AI algorithms and developing energy-aware scheduling algorithms for on-device AI apps.

Excessive energy consumption is a major constraint when designing and deploying the next generation of supercomputers. Minimizing energy consumption of high performance computing requires novel energy-conscious technologies at multiple layers from architecture, system support, and applications. One obstacle that hinders the exploration of these new technologies is the lack of tools and systems that can provide accurate, fine-grained, and real-time power and energy measurement for technology evaluation and verification.

This project bridges the gap by building Marcher, a heterogeneous high performance computing infrastructure equipped with cutting-edge power-efficient accelerators including Intel Many Integrated Cores and Nvidia Graphics Processing Units, power-aware memory systems, hybrid storage with hard disk drives and solid state disks, and high performance interconnects. The Marcher system supports the development of two complementary component-level power measurement tools for major computer components: (i) pluggable Power Data Acquisition Card (PODAC) for direct and decomposed power measurement and (ii) Software Power Meter (SoftMeter) that indirectly estimates the power consumption of systems where direct measurement is not feasible or too costly.

This project has succesfully completed and Marcher is available to a broader community and researchers to conduct research and education in green computing. Contact us if you want to post your research blogs at GreenSoft . Try GreenCode, the cloud-based IDE that compiles and runs programs written in more than 20 languages and reports the performance and energy consumption your code.

(PI) FastStor: Data-Mining-Based Multilayer Prefetching for Hybrid Storage Systems (Funded by NSF, Collaborate with Auburn University and Texas A&M University-Kingsville, $169,816, 09/2011 ~ 11/2014)

A large number of existing parallel storage systems consist of hybrid storage components, including solid-state drives (SSD), hard disks (HDD), and tapes. Compared with high-speed storage components (e.g. SSD and HDD), tapes inevitably become an I/O performance bottleneck. Prefetching and caching are commonly employed techniques to boost I/O performance by increasing the data hitting rate of high-end storage components. However, prefetching in the context of hybrid storage systems is technically challenging due to an interesting dilemma: aggressive prefetching schemes can efficiently reduce I/O latency, whereas overaggressive schemes may waste I/O bandwidth by transferring useless data from HDDs to SSDs or from tapes to HDDs. In this research project, we will investigate new data-mining-based multilayer prefetching techniques to improve performance of hybrid storage systems. The goals of this research are to (1) design data-mining algorithms for multilayer prefetching; (2) develop predictive parallel prefetching mechanism for SSD-based storage systems; (3) implement parallel data transfer among SSDs, HDDs, and tapes; (4) develop meta-data management schemes; and (5) implement a simulation framework named FastStor-SIM. The developed toolkit can be used to improve the I/O performance of data centers with hybrid storage systems.

Visualization is significantly changing the way we view spatial data and discover information. On the one hand, a large number of spatial data, which carry extremely valuable information, are generated every day. On the other hand, these data are not well utilized due to the lack of free and easily used data visualization tools. This becomes even worse when most of the spatial data remains in the form of plain text such as log files. This research aims at exploring the possibility of visualizing massive plain-text spatial data at no cost by utilizing publically available visualization tools like Google Earth. We illustrate our methods by visualizing over 990,000 global download requests for satellite images maintained by USGS EROS. The developed VisEROS web portal is able to visualize the properties of global download requests for the satellite images and analyze the hidden download patterns. This web portal allows users to intuitively access EROS global satellite image download requests data that can be visualized with Google Earth. This research demonstrates an easy way to visualize massive textual spatial data, which is highly applicable to mining spatially referenced data and information on a wide variety of research domains (e.g. hydrology, agriculture, atmospheric science, natural hazard and global climate change). The developed visualization techniques have attracted immediate attention, including a number of publications and two invited talks at the USGS EROS and the DOE Pacific Northwest National Laboratory.

(Lead PI) EEDAG: Exploring Energy-Efficient Parallel Tasks Generation and Scheduling for Heterogeneous Multicore Systems (Funded by NSF, Collaborate with Marquette University and UC-Riverside, 09/2011 ~ 08/2013, $195,731)

This project addresses optimizing energy efficiency in the execution of parallel algorithms. High energy cost is a salient constraint when running large scale parallel applications on the next generation of supercomputers that contain heterogeneous multicore processors and interconnections, motivating a rethinking of conventional approaches to modeling, designing and scheduling parallel tasks by taking energy-efficiency into consideration. In this project, we collaborate with Marquette University and UC-Riverside to explore energy-efficient parallel task design and scheduling as well as develop a power profiling tool that can measure decomposed runtime power consumption of different computing components (e.g. processors, memory, networks and disks).

Mobile technologies are ubiquitous in modern society, and college students find themselves increasingly immersed in their use. Students use devices like smart phones and iPods to network with friends, view and share pictures and videos, access specialized "apps," among myriad other activities. As a result, instructors find their students, distracted by their mobile devices, are not paying attention in. Is it possible to integrate this hand-held universe of mobile technology into the STEM classroom so that it becomes an organic, integral part of the learning experience rather than a 21st-century distraction from it? Can mobile technology be used to enhance and improve the learning experience of students? We attempt to do so at the South Dakota School of Mines and Technology, by developing a mobile portal through which instructors and students can access mobile content to supplement and expand the in-class learning environment. This educational project will address both the pedagogical considerations of the M-Learning portal, such as the choice of mobile content and the challenge of integrating its use naturally in the classroom, as well as the technological issues of it, such as its design and scalability. It will also provide initial results assessing students' learning efficiency in the mobile portal environment and evaluating how successfully the M-Learning Portal enhances student learning.

Overview

Current Research Projects

Completed Research Projects