|
|
|
|
|
Overview
My research interests center on large scale structure and cosmology,
specifically the intersection of data, measurement and theory. As current
and future surveys like the SDSS, LSST and PanSTARS increase our knowledge
base, exploiting the full potential of these resources will require the ability
to integrate these three areas in a cohesive manner. Only then will we be
able to address the fundamental physics behind processes like the formation
of galaxies and the behavior of dark energy.
|
|
|
|
|
Introduction
|
The next decade will witness an enormous increase in information available
for large scale structure cosmology. Deep, wide area surveys with
multi-band photometry will generate catalogs with hundreds of millions of
galaxies and thousands of terabytes of data. In principle, this will allow
us to measure quantities like the abundance of dark matter with exquisite
precision as well as give us the first strong limits on the behavior and
history of dark energy. Likewise, the integration of depth, luminosity,
color and time domain information will provide a much richer picture of
complicated, smaller scale processes like galaxy formation and evolution.
Extracting the full benefit from this new wealth of information will require
both an understanding of the scope of the data (and its limitations) and a
set of tools capable of handling its volume. The following few pages will
briefly outline my efforts along these lines over the past few years using
data from the SDSS and other current surveys.
|
|
|
|
Building the Data Set
|
My main project for the last several years has centered on analysis of
the photometric data from the SDSS and the enrichment of this data
set with additional parameters. The initial stage in this process was the
rigorous testing of the early SDSS reductions for systematic
errors in the photometric pipeline, both external (bad
seeing, excessive reddening by dust, etc.) and internal (variations in CCD
sensitivity, errors in deblending nearby objects, etc.). More recently, I have
been heading up the assembly of the photometric data set for the fifth
official SDSS data release (DR5), containing roughly 100 million
galaxies across an area of 7500 square degrees. We are augmenting the
standard photometric parameters with an improved version of a Bayesian
star/galaxy separation algorithm, photometric redshifts, and
a kernel-density based photometric QSO selection which increases
the number of QSOs by a factor of 10 over the SDSS spectroscopic survey at
better than 95% efficiency. The effects we want to measure are subtle, so
careful construction of such a catalog and controlling sources of systematic
error is vital.
|
|
|
|
|
Extracting the Information
|
This photometric catalog will provide a homogeneous, well-characterized data
set for various angular correlation projects. Initial SDSS measurements were
limited to the galaxy auto-correlation function and angular power
spectrum (measurements we are currently revisiting with our more
extensive data set), but this data set will let us go much further. We can
take advantage of the photometric redshifts to construct volume limited
samples at a much greater depth than is possible with the SDSS spectroscopic
survey, extending previous measurements in both area and
angular range. This allows us to probe much smaller physical scales than
current wide-area spectroscopic surveys, offering a greater insight into
the distribution of galaxies of different types and luminosities
throughout dark matter halos.
Over the past two years, we have also used this data set along with maps
from the Wilkinson Microwave Anisotropy Probe (WMAP) to
measure the induced cross-correlation between the foreground large scale
structure and background cosmic microwave background radiation.
The large-angle component of this cross-correlation comes as a result of the
ISW effect -- the late decay of gravitational potentials due to
accelerated cosmological expansion. For a flat universe, this serves as a
direct probe of the existence of dark energy. Taking advantage of the galaxy
color information available with the SDSS, we were able to select a
sub-population of galaxies (luminous red galaxies) from the
SDSS data and further subdivide them into four deep redshift slices. By
statistically combining the cross-correlations of these slices with the CMB
maps from WMAP (a single cross-correlation is shown in
Figure 1), we found a statistically significant detection
of the induced cross-correlation, a clear signature of dark energy.

Figure 1: Galaxy-CMB cross-correlation for luminous red galaxies
selected from the SDSS and temperature fluctuations from WMAP. The line
shows the expected signal ISW signal for a linear galaxy bias of 3.
During the last year, we have combined the photometric galaxy and quasar
catalogs to measure the cosmic magnification of the background quasars by
foreground large scale structure. Weak gravitational lensing
by the foreground matter induces a small correlation between the two distinct
populations, making the background quasars brighter but diluting their surface
density on the sky. A number of previous attempts have been made at this
measurement but they routinely disagreed with both the predicted signal and
each other. With our superior control over
sources of systematic errors in both the galaxy and quasar data sets as well
as a much higher density of quasars on the sky, we were able to detect the
expected lensing signal at the 8 sigma level. Thanks to the broad dynamic
range of the photometric data, we were also able to measure the expected
angular variation of the signal on physical scales ranging from
60 kpc/h to 10 Mpc/h (Figure 2).
Demonstrating that the expected magnification signal can be reliably detected
opens up a parallel track for all future weak lensing surveys, complementing
shear measurements and providing a crucial check on systematic biases.
Figure 2: Galaxy-QSO cosmic magnification for z > 1 photometric
quasars and z < 0.6 galaxies. The dashed red line is the expected
signal from standard theory and the solid black line indicates the best fit
to the data.
To make measurements over such a wide, densely filled area feasible, we have
developed
SDSSPix:
a hierarchical, equal-area pixelization of the sphere built around the SDSS
geometry. By integrating this method into our angular correlation function
algorithms, I have been able to reduce the typical running time by upwards of
three orders of magnitude. Measurements on our current data set actually take
far less processing time than our previous results, despite a factor of
40 increase in area. Without such an improvement, determining the statistical
errors for our current measurements (to say nothing of measuring higher
order moments) would be effectively impossible.
|
|
|
|
|
Understanding the Results
|
In addition to my work extracting statistical information from current data,
my other primary interest has been a phenomenological study of how current and
future surveys will inform various aspects of our current theories of galaxy
clustering and dark energy. On the galaxy clustering side, most of my work
has centered on the application and extension of semi-analytic halo
models to reflect the richness of future data sets. While
current theory has been reasonably successful in addressing classic large
scale structure measurements, current and future
surveys offer more information per object than the simple position on the
sky.
Multi-band imaging surveys give us the power to divide galaxies on
the basis of color, morphology, luminosity, and the like. With a few modest
extensions to the standard formalism, halo model theories
can make predictions for these sub-populations while maintaining the character
of the total population. In this extension, we
ultimately want to describe the population of dark matter halos by galaxies
-- the halo occupation density (HOD) -- not just as a function of halo mass,
but also galaxy type, luminosity and so on. My thesis applied this new
formalism to the expected angular clustering in the final SDSS data set,
finding that, in the simple case considering a volume-limited sample split
into red and blue sub-populations, one could expect to constrain the
parameters describing the HOD to better than 10%.
In addition to galaxy clustering, one can also extract information about
the HOD using magnification bias -- an induced cross-correlation
between background and foreground objects due to weak lensing. This effect is
a function of the number of galaxies in a halo on small scales,
while the auto-correlation is a function of the number of galaxy pairs
within a halo on those scales. As such, the combination should
give us an excellent description of the first two moments of the HOD, given
the expected errors from the completed SDSS photometric survey.
On the dark energy front, current and future meaurements of the ISW will
likely provide relatively weak constraints on standard cosmological
parameters. However, as we show in our paper, combining
the ISW, galaxy auto-correlation, and weak lensing results from a full-sky
deep galaxy survey may be singularly useful in other ways. In particular,
measuring the scale for dark energy clustering (if one exists) is extremely
difficult with most cosmological statistics. Despite the relatively
poor statistical significance of the ISW signal for an optimal survey
(S/N ~ 10), the nature of the effect is such that one could
constrain the clustering of dark energy to 3% precision at Gpc scales.
|
|
|
|
|
Future Synergy
|
While current surveys provide invaluable data sources, the future of large
scale structure science will be powered by data conglomerations like the
National Virtual Observatory. The size of future datasets demands moving
from simple collections of flat files to structured, indexed databases,
taking advantage of years of computer science research in database design.
This migration also makes compiling information on objects from different
surveys across the electromagnetic spectrum possible on a massive scale.
In addition to merely serving raw data, these virtual observatories will also
act as repositories for the means to translate that data into the statistical
measures that will constrain our future models of galaxy formation and
evolution. Our group in Pittsburgh has taken a leading role in this effort,
forming the GESTALT collaboration (Figure 3; GESTALT E-Science
Telescope: Area, Location \& Time) to develop tools for data exploration as
well as data analysis, calibration and integration. Our current projects
include an integrated method for applying the fast angular correlation codes
described earlier directly to photometric data from databases serving the
2MASS, FIRST, and SDSS data. This service is completely automated, taking a
database query, generating all of the appropriate footprint and masking
information and returning a correlation function. Unlike the current state
of cosmology, where data selection and analysis remains largely a
``black-box'' affair, this service will be completely transparent, with open
source code and easy replication by any later investigator. More services
like this are on the way and they will serve as the backbone of cosmology
for decades to come.
Figure 3: The footprints for the 2MASS, FIRST and SDSS surveys as
well as the contents of the NOAO Science Archive. This map was generated by
pixelizing each of these surveys with SDSSPix and interfacing with the GESTALT
footprint server.
|
|
|
|
|
Teaching & Outreach
|
My time as a graduate student included two years as a teaching assistant,
running lab sessions for an undergraduate astronomy survey course. For the
majority of the time, this consisted of running pre-planned lab experiments.
However, a third of each year was given over to a quarter-long project of
my own design. Using data from the Supernova Cosmology Project, students
looked for supernovae, performing simple image subtraction with educational
software. At the end of the term, some of the more enterprising students were
able to succussfully fit a simple Hubble diagram. In my last year in
graduate school, I spent a quarter leading a similar supplemental session for
advanced astrophysics undergraduates. This time, I guided the students through
the basics of cosmology statistics. By the end of the course, they were able
to translate a simple angular correlation estimator into compiled C code and
generate a measurement of the angular correlation function from SDSS data.
As a post-doctoral researcher, my time spent in classrooms has been largely
limited to occasionally filling in lectures for out of town faculty members.
However, the notariety from the ISW measurement led to a number of public
lectures in and around Pittsburgh (see here for
a full list). The audiences for these talks varied from high school and
college students to amateur astronomers and the general public. I am
particularly interested in increasing communication between scientists and
the rest of the community, so being able to give these talks and having them
well received was especially satisfying for me.
|
|
|