Johannes Gehrke
Affiliation
Cornell
University
Research Interest
My research spans both data mining and
distributed query processing. In data
mining, we are building a distributed data-mining infrastructure
that
permits continuous monitoring of remote data sources, and we are working
on improving the performance of existing data mining tools towards
interactivity. My research in distributed query processing spans data
management for sensor networks, processing queries over data streams,
and peer-to-peer systems.
Foresight
The widespread distribution and
availability of small-scale sensors, actuators,
and embedded processors is transforming the physical world into
a computing platform. Sensor networks consisting of a large number of
sensor nodes that combine physical sensing capabilities with networking
and computation capabilities will be deployed widely in the near
future. We need a data management infrastructure to respond to the data
tsunami, a system that scales with the growth of interconnectivity and
computational power over the next decades, and provides scalable, fault-tolerant,
flexible data access and intelligent data reduction, and its
design will require novel research in database systems, data mining,
networking,
and fault-tolerance.