Towards a High Productivity Automatic Analysis Framework for Classification: An Initial Study

Towards a High Productivity Automatic Analysis Framework for Classification: An Initial Study

Abstract

Due to the recent explosion of research data based on novel scientific instruments and corresponding experiments, automatic features, in particular in data analysis, has become more essential than ever. In this paper we present a new Automatic Analysis Framework (AAF) that is able to increase the productivity of data analysis. The AAF can be used for classifications, predictions and clustering. It is built upon the workflow engine Taverna, which is widely used in different domains and there exists a large number of Taverna activities for various kinds of analytical methods. The AAF enables scientists to modify our predefined Taverna workflow and to extend it with other available activities. For the execution of the analytical methods, in particular for the computation of the results, we use our own cloud-based Code Execution Framework (CEF). It provides web services to execute problem solving environment code, such as MATLAB, Octave, and R scripts, in parallel in the cloud. This combination of the AAF and CEF enables scientists to easily conduct time-consuming calculations without the need to manually combine potential combinations of independent variables. It furthermore automatically evaluates all identified models and provides service for the scientists conducting the analysis. The framework has been tested and evaluated with real breath gas data.

Grafik Top
Authors
  • Ludescher, Thomas
  • Feilhauer, Thomas
  • Amann, Anton
  • Brezany, Peter
Grafik Top
Editors
  • Perner, P.
Grafik Top
Projects
Grafik Top
Shortfacts
Category
Paper in Conference Proceedings or in Workshop Proceedings
Event Title
Industrial Conference on Data Mining (ICDM) 2013, New York, U.S.A., July 21-26, 2013
Divisions
Scientific Computing
Event Location
New York, USA
Event Type
Conference
Event Dates
21-26 July 2013
Series Name
7987
Page Range
pp. 25-39
Date
December 2013
Export
Grafik Top