The Third International Workshop on Big Data Reduction

held with 2022 IEEE International Conference on Big Data

Branching

Introduction

Today’s modern applications are producing too large volumes of data to be stored, processed, or transferred efficiently. Data reduction is becoming an indispensable technique in many domains because it can offer a great capability to reduce the data size by one or even two orders of magnitude, significantly saving the memory/storage space, mitigating the I/O burden, reducing communication time, and improving the energy/power efficiency in various parallel and distributed environments, such as high-performance computing (HPC), cloud computing, edge computing, and Internet-of-Things (IoT). An HPC system, for instance, is expected to have a computational capability of floating-point operations per second, and large-scale HPC scientific applications may generate vast volumes of data (several orders of magnitude larger than the available storage space) for post-anlaysis. Moreover, runtime memory footprint and communication could be non-negligible bottlenecks of current HPC systems.

Tackling the big data reduction research requires expertise from computer science, mathematics, and application domains to study the problem holistically, and develop solutions and harden software tools that can be used by production applications. Specifically, the big-data computing community needs to understand a clear yet complex relationship between application design, data analysis and reduction methods, programming models, system software, hardware, and other elements of a next-generation large-scale computing infrastructure, especially given constraints on applicability, fidelity, performance portability, and energy efficiency. New data reduction techniques also need to be explored and developed continuously to suit emerging applications and diverse use cases.

There are at least three significant research topics that the community is striving to answer: (1) whether several orders of magnitude of data reduction is possible for extreme-scale sciences; (2) understanding the trade-off between the performance and accuracy of data reduction; and (3) solutions to effectively reduce data size while preserving the information inside the big datasets.

The goal of this workshop is to provide a focused venue for researchers in all aspects of data reduction in all related communities to present their research results, exchange ideas, identify new research directions, and foster new collaborations within the community.

Please note this year’s IEEE BigData conference and IWBDR workshop will be held online. Proceedings of the workshop will be published as planned. We will provide more details about how to attend this workshop virtually.

Submissions

Topics of Interest

The focus areas for this workshop include, but are not limited to:

Proceedings

All papers accepted for this workshop will be published in the Workshop Proceedings of IEEE Big Data Conference, made available in the IEEE eXplore digital library.

Submission Instructions

Important Dates

Organizers

Program Chairs

Web Chair

Program Committee (Planned)

Program Schedule

Date: December 18, 2022

Timezone: Japan Time (JST), UTC+9

Time Title
2:00 - 2:05 pm Opening Remarks and Welcome
  Dingwen Tao, Xin Liang, Sheng Di
2:05 - 3:00 pm Invited Talk: Compressed Data Direct Processing for Big Data Reduction
  Feng Zhang, Associate Professor, Renmin University
3:00 - 3:20 pm S11201: Machine Learning Platform for Extreme Scale Computing on Compressed IoT Data.
  Seshu Tirupathi, Dhaval Salwala, Giulio Zizzo, Ambrish Rawat, Mark Purcell, Søren Kejser Jensen, Christian Thomsen, Nguyen Ho, Carlos E. Muniz Cuza, Jonas Brusokas, Torben Bach Pedersen, Giorgos Alexiou, Giorgos Giannopoulos, Panagiotis Gidarakos, Alexandros Kalimeris, Stavros Maroulis, George Papastefanatos, Ioannis Psarros, Vassilis Stamatopoulos, and Manolis Terrovitis
3:20 - 3:40 pm S11202: Lossy Compression to Reduce Latency of Local Image Transfer for Autonomous Off-Road Perception Systems
  Max Faykus, Bradley Selee, Jon Calhoun, and Melissa Smith
3:40 - 4:00 pm BigD616: Estimating Potential Error in Sampling Interpolation
  Megan Hickman Fulp, Dakota Fulp, and Jon Calhoun
4:00 - 4:10 pm Coffee Break
4:10 - 4:30 pm S11204: Towards Guaranteeing Error Bound in DCT-based Lossy Compression
  Jiaxi Chen, Aekyeung Moon, and Seung Woo Son
4:30 - 4:50 pm S11203: Exploring Data Corruption Inside SZ
  Ruiwen Shan and Jon C. Calhoun
4:50 - 5:10 pm BigD247: Extraction of Power Consumption Patterns using Non-negative Tucker Decomposition
  Taku Moriyama, Mio Hosoe, Masashi Kuwano, and Yuka Minamino
5:10 - 5:15 pm Closing Remarks

Participation

Office website: https://events.rdmobile.com/Sessions/Details/1638858