A data as a product model for future consumption of big stream data in clouds
conference contribution
posted on 2023-05-23, 12:07authored byHuang, G, He, J, Chi, C, Zhou, W, Zhang, Y
Data is becoming the world's new natural resource and big data use grows quickly. The trend of computing technology is that everything is merged into the Internet and 'big data' are integrated to comprise complete information for collective intelligence. With the increasing size of big data, refining big data themselves to reduce data size while keeping critical data (or useful information) is a new approach direction. In this paper, we provide a novel data consumption model, which separates the consumption of data from the raw data, and thus enable cloud computing for big data applications. We define a new Data-as-a-Product (DaaP) concept, a data product is a small sized summary of the original data and can directly answer users' queries. Thus, we separate the mining of big data into two classes of processing modules: the refine modules to change raw big data into small sized data products, and application-oriented mining modules to discover desired knowledge further for applications from well-defined data products. Our practices of mining big stream data, including medical sensor stream data, streams of text data and trajectory data, demonstrated the efficiency and precision of our DaaP model for answering users' queries.
History
Publication title
Proceedings of 2015 IEEE International Conference on Services Computing
Editors
PP Maglio, I Paik, W Chou
Pagination
256-263
Department/School
School of Information and Communication Technology
Publisher
Institute of Electrical and Electronics Engineers, Inc.
Place of publication
Piscataway, NJ, United States
Event title
2015 IEEE International Conference on Services Computing
Event Venue
New York City, New York, United States
Date of Event (Start Date)
2015-06-27
Date of Event (End Date)
2015-07-02
Rights statement
Copyright 2015 IEEE
Repository Status
Restricted
Socio-economic Objectives
Other information and communication services not elsewhere classified