Home | Search | Help  
Home Page Università di Genova

Seminar Details

Date 17-3-2010
Time 16:30
Room/Location DISI-Sala Conferenze 3 piano
Title Multimedia Data Processing: Efficiency, Scalability, and Effectiveness.
Speaker K. Selcuk Candan, Professor
Affiliation Computer Science and Engineering School of Computing-IDSE (CIDSE) Arizona State University
Link http://www.public.asu.edu/~candan/index.htm
Abstract Abstract: Media analysis and mining involve processing of large quantities of real-time and/or stored data and measurements. Data (whether captured in real-time through sensory measurements or processed, materialized, and stored for later use) are many times accurate only within a margin of error. Moreover, in many applications, the utility of a data element to a particular analysis task depends on the usage context. The fundamental principles that govern the next generation of media/data analysis middleware must include data and operator imprecision, relevance of data to a particular analysis task, and the interest and expertise of the knowledge consumer. In this talk, I will discuss challenges for and opportunities in developing efficient and effective analysis middleware to support large scale data processing and decision making applications, where the data elements, metadata, and the operations on the data may be imprecise. As a specific example, I will introduce and describe RankMR for processing ranked queries in batch-oriented cluster environments. In particular, I describe how RankMR adaptively samples data from "upstream" operators to help allocate resources in a work balanced and wasted-work avoiding manner for top-k join processing. Experimental results show that the proposed sampling, data partitioning, and join processing strategies enable RankMR to return utlity ranked results with high confidence and low-overhead.
Back to Seminars