

Object detection is a pervasive area of research in Computer Vision. Its applications are vast and include medical pathology (detection of malignant cells in histology slides), brain imaging (identification of structural signs of brain disease), detection of objects in remotely sensed imagery (traffic analysis and target detection), face detection for digital camera technology, and pedestrian tracking for surveillance. A broad spectrum of object detection approaches have been proposed in the literature, often using the popular boosting algorithm, AdaBoost. I propose a new object detection system that employs a new learning framework for combining ensembles of weak detectors into a single detector.
Prior work:

Damian Eads
UCSC Student

Ethan Miller
UCSC Instructor

Hai Tao
UCSC Instructor

David Helmbold
UCSC Instructor

James Theiler
Mentor

Simon Perkins
Mentor

Edward Rosten
Mentor

Andrew Fraser
Mentor

RAID systems have traditionally offered increased performance and data security in small storage systems. An opportunity now exists to extend traditional RAID principles into the area of large-scale object-based storage devices in order to offer greater data security and space efficiency. In a system where component failures can be expected on a daily basis, the importance of redundancy mechanisms is obvious, and RAID principles offer an appropriate model. Ceph is an excellent platform to test these RAID systems and to learn how they function in a new environment.

David Bigelow
UCSC Student

Scott Brandt
UCSC Instructor
Gary Grider
Mentor

James Nunez
Mentor

John Bent
Mentor

HB Chen
Mentor

The ISIS team in ISR-2 primarily uses supervised learning techniques to solve classification problems in imagery and therefore has a strong interest in finding linear classification algorithms that are both robust and efficient. Boosting algorithms take a principled approach to finding linear classifiers and they has been shown to be so effective in practice that they are widely used in a variety of domains. In this proposal we present evidence that smoothing is not necessarily the optimal way

Karen Glocer
UCSC Student

Manfred Warmuth
UCSC Instructor

James Theiler
Mentor

Simon Perkins
Mentor

Tuning is one of the most challenging and important tasks in setting up a database system. A typical assumption is that a representative workload can provide the means to perform some initial tuning automatically, which can then be refined manually by the database administrator. However, several emerging application domains challenge this assumption. A characteristic example is scientific data
management, where the use of ad-hoc data analytics cancels the ability to gather a representative workload, and the lack of database expertise among scientists lowers the possibility for efficient manual tuning. This motivates us to look into autonomic tuning techniques that operate continuously and require minimal intervention from an administrator. In particular, we examine online methods to tune the physical database design, i.e., the set of materialized data structures, such as indices and materialized views, that are crucial for efficient query processing.

Karl Schnaitter
UCSC Student

Neoklis Polyzotis
UCSC Instructor
Gary Grider
Mentor

James Nunez
Mentor

John Bent
Mentor

We propose to work on the problem of calibrating the parameters of computer code used for simulation of physical phenomena. We will explore statistical methods based on a Bayesian approach implemented with Sampling Importance Resampling (SIR).

Tracy Holsclaw
UCSC Student

Herbie Lee
UCSC Instructor

Bruno Sanso
UCSC Instructor

David Higdon
Mentor

Katrin Heitmann
Mentor

Salman Habib
Mentor

Pseudorandom placement in distributed storage systems offers scalability benefits. Pseudorandom placement makes load balancing harder; new techniques are required. We explore different load balancing techniques using Ceph, an object-based storage system developed at UCSC.
Esteban Molina-Estolano
UCSC Student

Scott Brandt
UCSC Instructor
Gary Grider
Mentor

James Nunez
Mentor

John Bent
Mentor

Large data centers are typically composed of many hard drives arranged in a striped layout with redundancy, to provide adequate performance and reliability. Solid state disk drives (SSDs) are a newer technology that has better performance and lower power requirements compared to hard drives. This research examines the use of solid state disk drives in disk-SSD hybrid storage systems.

Rosie Wacha
UCSC Student

Scott Brandt
UCSC Instructor
Gary Grider
Mentor

James Nunez
Mentor

John Bent
Mentor

Definition: Mutivalue data are where you have multiple values about the same variable in repeated measurements.
Challenge: Lack of current comprehensive visualization tools for the probabilistic or uncertain nature of the multivalue data.
Research: Investigate probabilistic streamlines using different ways of combining vector distributions.
The size of the data sets and the uncertainty in the data sets come from the fact that we are dealing with ensemble data sets. These are usually from Monte Carlo simulations where each output (out of many runs) represents a possible solution. The degree of agreement (or disagreement) provides some indications of certainty (or uncertainty) about the results. Because Monte Carlo simulations can potentially involve large number of repetitions, the total data size can very quickly get very large. This project will explore uncertainty and how visualization can be used as a tool to help deal with it.

Eddy Chandra
UCSC Student

Alex Pang
UCSC Instructor

Katrin Heitmann
Mentor

James Ahrens
Mentor

To help users make sense of collaboratively-generated information, we are developing algorithmic notions of information trust

Ian Pye
UCSC Student

Bo Adler
UCSC Student

Luca de Alfaro
UCSC Instructor

Shelly Spearing
Mentor

Jorge Roman
Mentor
We focus on developing intelligent interactive search and browsing techniques to help users and the information they are looking for from billions of non-relevant files

Jessica Gronski
UCSC Student

Yi Zhang
UCSC Instructor

Herbert Van de Sompel
Mentor

In large tightly coupled parallel systems, computation goes as fast as the slowest part. For this reason it is necessary to pursue deterministic behavior of all parts of the system. Quality of Service is one way to assist in providing deterministic behavior. This project will explore providing Quality of Service on networks of interest to high performance computing.
Andrew Shewmaker
UCSC Student

Scott Brandt
UCSC Instructor
Andrew Shewmaker
Mentor

The research objective of this proposal is to measure human body shape and motion without augmenting the subject. The hypothesis is that replacing traditional cameras with high accuracy 3D shape measurement devices and utilizing a carefully constructed prior model of human surface shape are the critical factors that have been missing from prior attempts to meet this goal. The long term accuracy targets are shape to 1mm and motion to 1deg.

Steve Scher
UCSC Student

James Davis
UCSC Instructor

Sriram Swaminarayan
Mentor
We are planning to develop a web-based system to help 'decision makers' quickly identify and process relevant web-based information in case of a disease outbreak. We will work on identifying the pathogen based on sequence information. We will also develop an adaptive information filtering to find, filter and condense the information available on the web.

Yi Zhang
UCSC Instructor

Carla Kuiken
Mentor