Data management challenges in production machine learning

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 299
  • Download : 0
The tutorial discusses data-management issues that arise in the context of machine learning pipelines deployed in production. Informed by our own experience with such largescale pipelines, we focus on issues related to understanding, validating, cleaning, and enriching training data. The goal of the tutorial is to bring forth these issues, draw connections to prior work in the database literature, and outline the open research questions that are not addressed by prior art.
Publisher
ACM Special Interest Group on Management of Data (SIGMOD)
Issue Date
2017-05
Language
English
Citation

2017 ACM SIGMOD International Conference on Management of Data, SIGMOD 2017, pp.1723 - 1726

DOI
10.1145/3035918.3054782
URI
http://hdl.handle.net/10203/241396
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0