Transformations of large data sets

Problem

Though data analysis tools continue to improve, analysts still expend an inordinate amount of time and effort manipulating data and assessing data quality issues. Such “data wrangling” regularly involves reformatting data values or layout, correcting erroneous or missing values, and integrating multiple data sources. These transformations are often difficult to specify and difficult to reuse across analysis tasks, teams, and tools.

Aim

Developing a software prototype that facilitates the specification and execution of transformations of large data tables.

Other information

Starting point(s) for research (contact person listed below for details):

Wrangler: Interactive Visual Specification of Data Transformation Scripts

Contact

Further information

Area
Visual Analytics (VA)
Not specified
Scope
BA
PR
MA
Assigned as
Master thesis/Diplomarbeit
Status
in progress