About DSHydro and the PI
Nicoleta Cristea is a research hydrologist in the University of Washington (UW) Department of
Civil and Environmental Engineering and a Data Science Fellow at
the UW eScience Institute, Seattle. She is interested in multiple
aspects of water resources research with a focus on climate change effects on water resources, modeling and
remote sensing. At the UW eScience Institute, Nicoleta is co-hosting the
UW Data Science Seminar, a more than a
decade old seminar series.
Research
Our research focuses on understanding large and fine scale hydrologic phenomena in a changing climate through
combining field and remotely sensed observations with physics-based modeling and data driven discovery. We are particularly
interested in methods’ transferability from smaller to larger scales or to different geographic settings.
In our research, we use tools involving geographic information systems, airborne and satellite-derived imagery,
spatial statistics and spatial data exploration, spatiotemporal visualization techniques, machine learning
and physics-based modeling. Current research projects include mapping detailed snow covered
areas from Cubesat imagery using machine learning techniques and snowpack modeling across complex terrain.
Data Science
Machine learning applications in water resources and other geoscience fields are increasingly used to advance data
driven discovery. To facilitate faster adoption of these techniques we
are developing GeoSMART, an educational
framework designed to provide online resources and hands-on learning through
community building and hackweek events.
Cyberinfrastructure for Intelligent High-Resolution Snow Cover Inference from Cubesat Imagery
Funding: NSF Geoinformatics
The ability to observe the Earth from space at relevant spatial and temporal scales is key to understanding how
hydrological and ecological systems will respond to climate change. In particular, high spatial and temporal
resolution (meter scale, daily frequency) observations of snow-covered areas in mountain regions are critical as
snow is important for water resources, driving the seasonal hydrological regimes of the Western U.S., with
significant impacts on ecological communities. Planet Labs, Inc. (Planet) is a promising new source of commercial
Cubesat high-resolution imagery that can be used in environmental science, as it has both high spatial (3.0-4.0 m)
and temporal (1-2 day) resolution. This project will develop open-source, cloud-based cyberinfrastructure
including an automated pipeline for processing, analyzing and interpreting Planet Cubesat image data using a
machine learning approach to infer snow cover at meter-scale resolution. All models and data products will be
openly available for use and modification by scientific communities. The project will support the training of
students, postdocs and other early-career researchers through training events, special interest groups, and
incubator programs.
Currently, remotely-sensed snow observations with adequate temporal (daily) resolution are either captured at a
spatial scale far too large to be relevant to high-resolution hydrology and ecology studies (e.g. MODIS, 500m) or
are appropriate in spatial scale (1-10 m) but have inadequate temporal resolution and are cost-prohibitive (e.g.
airborne LiDAR). The recent increase of commercial Earth Observation data with high spatiotemporal resolution may
bridge the gap between ground-based and low-resolution satellite observation data. This project will focus on
using convolutional neural networks-based models to couple ground and airborne-derived snow observations with
Planet imagery in three different montane systems in Washington, California, and Colorado. These sites have very
good coverage of ground and airborne snow observations at high resolution (3m) collected by the NASA Airborne Snow
Observatory (ASO) and SnowEx missions, which will be used in the training and validation of the models. The
project will develop advanced cyberinfrastructure using scalable virtual machines, distributed collaborative
architecture, reusable computational frameworks, and replicable machine learning workflows to empower Earth
scientists to access, process and generate high-resolution snow products from Cubesat data. The project will adopt
open-source strategies and ensure that all data, algorithms, and architecture comply with FAIR data principles and
reproducibility and will include training materials that promote the adoption of the infrastructure and tools.
Read
more...
Machine Learning Training and Curriculum Development for Earth Science Studies
Funding: NSF Cybetraining
Earth system science discoveries are increasingly affected by the management, analysis, and data inference using
powerful machine learning (ML) techniques. Yet, the skills required to perform these tasks, and the education of
cutting-edge, open-source technology to build ML models and pipelines, big data, and cloud computing, are not
covered by the traditional graduate curriculum in the geosciences. To fill these gaps, this project will develop
the GeoScience MAchine Learning Resources and Training (GeoSMART) framework that will build a foundation in
open-source scientific ecosystems and general ML theory, toolkits, and deployment on Cloud computing.
This project will include a team of geoscience and ML educators to create a novel ML curriculum with focus on
seismology, cryosphere and hydrology applications. The training materials will be included in an enhanced
curriculum that will broaden impact on emerging ML communities. The project's implementation plan will provide
training in open-source ML toolkits and data science skills. Further, the project will cultivate the development
of discipline-specific ML libraries, workflows, and communities of practice to sustain future growth of ML
cybertraining opportunities. By building tools using open-source and cloud-accessible platforms, and by partnering
with colleges and institutions that lack computing resources for ML workflows, the project will increase access to
cybertraining materials and help to solve geoscience challenges.
Evaluation of remotely sensed snow covered area datasets across the Sierra Nevada meadows
Funding: NASA
We will generate high-resolution SCA maps across the Sierra Nevada watersheds and will examine the snow mapping
performance across and around the meadow areas at several times during the ablation season. Meadow ecosystems of
the Sierra Nevada are dependent on the snowpack, groundwater and other hydrologic functions that ensure proper
habitat for diverse biota, providing key ecosystem services in a fragile equilibrium, and at risk from human and
hydroclimatic changes. We will use the digitized meadow dataset to identify meadows with an area greater than 1
acre (~4000 m2) and will analyze the model performance for areas with at least 3x3 MODIS pixels. We will compare
the reconstructed fSCA from the Planet data with MODSCAG fSCA and other forest-corrected fSCA products to diagnose
differences in snow data from the two different satellite image sources with similar temporal resolution (~daily)
but very different spatial resolutions.
What's in a pixel? Snow water equivalent and subpixel variability at multiple spatial resolutions in mountainous
terrain
Funding: NASA
Snowpack melt in spring and summer is critical to meet water demands in the Western United States. Reliable
streamflow forecasting for water management requires robust estimation of the water stored in the snowpack and
applications of distributed hydrologic models. Recent applications in acquisition and processing of light
detection and ranging (lidar) observations to estimate snow depth made possible to evaluate how a hydrologic model
represents the spatial distribution of snow. Multiple studies have identified causes for model uncertainty across
a range of models of various complexity. Snowmelt models are uncertain in their predictions primarily due to:
inadequate or erroneous forcing data, model structure and process representation, as well as coarse spatial
resolution and representation of subgrid variability. We propose to apply different resolution distributed
hydrologic models across several mountainous regions where the lidar-derived snow depth datasets are available. We
will use forcing datasets from multiple sources to run different types of models at variable spatial resolutions
ranging 30-1000 m. We will examine models' performances to simulate snow water equivalent at different spatial
scales, paying particular attention to models' abilities to represent subgrid variability, spatial patterns of
snow in complex terrain and snow in forested areas.
Snow in complex terrain is highly variable, so distributed modeling is essential for proper representations of
snow and streamflow. Remote-sensing (using both space-borne and air-borne instruments) provides critical spatial
measurements, but to definitively identify the “best” model configuration, multiple measurements are needed at
different times of year and at different spatial scales. We combine hydrologic modeling with remotely sensed
distributed snow depth and land surface temperature data to provide diagnostic in model ability to represent
distributed snow processes.
DSHydro News & Updates
Article Title
00/00/00
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore
magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo
consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla
pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id
est
laborum.
Article Title
00/00/00
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore
magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo
consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla
pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est
laborum.
← Back
Full Article
Recent Publications
Cannistra, A.F., Shean, D.E. and Cristea, N.C., 2021. High-resolution CubeSat imagery and machine learning for
detailed snow-covered area. Remote Sensing of Environment, 258, p.112399.
Breckheimer, I.K., Theobald, E.J., Cristea, N.C., Wilson, A.K., Lundquist, J.D., Rochefort, R.M. and
HilleRisLambers, J., 2020. Crowd-sourced data reveal social-ecological mismatches in phenology driven by climate.
Frontiers in Ecology and the Environment, 18(2), pp.76-82.
Lundquist, J.D., Chickadel, C., Cristea, N., Currier, W.R., Henn, B., Keenan, E. and Dozier, J., 2018. Separating
snow and forest temperatures with thermal infrared remote sensing. Remote Sensing of Environment, 209,
pp.764-779.
Cristea, N.C., Breckheimer, I., Raleigh, M.S., HilleRisLambers, J. and Lundquist, J.D., 2017. An evaluation of
terrain-based downscaling of fractional snow covered area data sets based on LiDAR-derived snow data and
orthoimagery. Water Resources Research, 53(8), pp.6802-6820.
DSHydro Team
Who we are and why we do it!
Nicoleta is a research hydrologist interested in combining computer modeling of freshwater systems with remote
sensing and spatial and temporal analyses. Like in many fields, efficiently storing, analyzing, and
visualizing the increasingly large datasets are becoming crucial for conducting successful research. We are
working on including modern data science techniques in teaching and research activities. Nicoleta holds a PhD
degree from the University of Washington. In her spare time she enjoys spending time with her family,
traveling, and skiing.
Kehan Yang is a Data Science Postdoctoral Fellow working with Dr. Nicoleta Cristea and Dr. Jessica Lundquist
in the Department of Civil and Environmental Engineering and eScience Institute at the University of
Washington. She earned her Ph.D. in Geography at University of Colorado Boulder, advised by Dr. Noah Molotch.
Dr. Yang's research is focused on advancing our understanding of spatiotemporal variability of seasonal
snowpack and snow water resources using multi-platform remote sensing observations and statistical learning
models. Her current project focuses on developing a cloud-based cyberinfrastructure to derive high
spatiotemporal resolution (i.e., ~ daily, ~3 m) snow cover information from commercial CubeSat imagery with
particular interest in vulnerable mountain ecosystems.
Aji John is a recent University of Washington (UW) Biology graduate working with Dr. Josh Lawler (UW SEFS)
and Dr. Nicoleta Cristea (UW Civil Engg.). Aji is actively researching use of fine-spatial satellite imagery
to investigate flowering patterns of montane species, and looking at distribution of snow in montane areas (a
key driver of productivity). He is also affiliated with eScience Institute at UW as a Postdoctoral fellow.
Stefan Todoran is University of Washington Computer Science student working primarily with Dr. Nicoleta
Cristea and Dr. Marine Denolle as part of the GeoSMART team. Stefan is currently building infrastructure
for the advancement of ML tools within the geosciences, ranging from the GeoSMART and DSHydro websites and
blogs to CI/CD automation for GeoSMART curricula. In addition, Stefan is currently a part of multiple research
projects, including the use of computer vision in the reconstruction of misaligned image datasets, as well as
the use of aerial photography for cold water refuge mapping. In his free time he enjoys arts & crafts, skiing,
dancing and traveling!