We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.OT

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Other Statistics

Title: A framework for understanding data science

Abstract: The objective of this research is to provide a framework with which the data science community can understand, define, and develop data science as a field of inquiry. The framework is based on the classical reference framework (axiology, ontology, epistemology, methodology) used for 200 years to define knowledge discovery paradigms and disciplines in the humanities, sciences, algorithms, and now data science. I augmented it for automated problem-solving with (methods, technology, community). The resulting data science reference framework is used to define the data science knowledge discovery paradigm in terms of the philosophy of data science addressed in previous papers and the data science problem-solving paradigm, i.e., the data science method, and the data science problem-solving workflow, both addressed in this paper. The framework is a much called for unifying framework for data science as it contains the components required to define data science. For insights to better understand data science, this paper uses the framework to define the emerging, often enigmatic, data science problem-solving paradigm and workflow, and to compare them with their well-understood scientific counterparts, scientific problem-solving paradigm and workflow.
Comments: 28 pages, 10 figures
Subjects: Other Statistics (stat.OT); Methodology (stat.ME)
ACM classes: I.2.0; I.2.8; E.0
Cite as: arXiv:2403.00776 [stat.OT]
  (or arXiv:2403.00776v1 [stat.OT] for this version)

Submission history

From: Michael Brodie [view email]
[v1] Wed, 14 Feb 2024 15:55:40 GMT (4403kb)

Link back to: arXiv, form interface, contact.