Package 'ncdfgeom' reference manual

Title:	'NetCDF' Geometry and Time Series
Description:	Tools to create time series and geometry 'NetCDF' files.
Authors:	David Blodgett [aut, cre], Luke Winslow [ctb]
Maintainer:	David Blodgett <[email protected]>
License:	CC0
Version:	1.3.0
Built:	2025-03-19 05:34:01 UTC
Source:	https://github.com/doi-usgs/ncdfgeom

Area Weighted Intersection

Description

Returns the fractional percent of each feature in x that is covered by each intersecting feature in y. These can be used as the weights in an area-weighted mean overlay analysis where x is the data **source** and area- weighted means are being generated for the **target**, y.

This function is a lightwieght wrapper around the functions aw_intersect aw_total and aw_weight from the areal package.

Usage

calculate_area_intersection_weights(x, y, normalize, allow_lonlat = FALSE)
calculate_area_intersection_weights(x, y, normalize, allow_lonlat = FALSE)

Arguments

`x`	sf data.frame source features including one geometry column and one identifier column
`y`	sf data.frame target features including one geometry column and one identifier column
`normalize`	logical return normalized weights or not. Normalized weights express the fraction of target polygons covered by a portion of each source polygon. They are normalized in that the area of each source polygon has already been factored into the weight. Un-normalized weights express the fraction of source polygons covered by a portion of each target polygon. This is a more general form that requires knowledge of the area of each source polygon to derive area-weighted statistics from source to **target. See details and examples for more regarding this distinction.
`allow_lonlat`	boolean If FALSE (the default) lon/lat target features are not allowed. Intersections in lon/lat are generally not valid and problematic at the international date line.

Details

Two versions of weights are available:

'normalize = FALSE', if a polygon from x (source) is entirely within a polygon in y (target), w will be 1. If a polygon from x (source) is 50 and 50 in each. Weights will sum to 1 per **SOURCE** polygon if the target polygons fully cover that feature.

For 'normalize = FALSE' the area weighted mean calculation must include the area of each x (source) polygon as in:

> *in this case, 'area' is the area of source polygons and you would do this operation grouped by target polygon id.*

> 'sum( (val * w * area), na.rm = TRUE ) / sum(w * area)'

If 'normalize = TRUE', weights are divided by the target polygon area such that weights sum to 1 per TARGET polygon if the target polygon is fully covered by source polygons.

For 'normalize = FALSE' the area weighted mean calculation no area is required as in:

> 'sum( (val * w), na.rm = TRUE ) / sum(w)'

See examples for illustration of these two modes.

Value

data.frame containing fraction of each feature in x that is covered by each feature in y.

Examples


library(sf)

source <- st_sf(source_id = c(1, 2), 
                val = c(10, 20), 
                geom = st_as_sfc(c(
  "POLYGON ((0.2 1.2, 1.8 1.2, 1.8 2.8, 0.2 2.8, 0.2 1.2))", 
  "POLYGON ((-1.96 1.04, -0.04 1.04, -0.04 2.96, -1.96 2.96, -1.96 1.04))")))

source$area <- as.numeric(st_area(source))

target <- st_sf(target_id = "a", 
                geom = st_as_sfc("POLYGON ((-1.2 1, 0.8 1, 0.8 3, -1.2 3, -1.2 1))"))

plot(source['val'], reset = FALSE)
plot(st_geometry(target), add = TRUE)

(w <- 
calculate_area_intersection_weights(source[c("source_id", "geom")], 
                                    target[c("target_id", "geom")], 
                                    normalize = FALSE, allow_lonlat = TRUE))

(res <-
merge(st_drop_geometry(source), w, by = "source_id"))

sum(res$val * res$w * res$area) / sum(res$w * res$area)

(w <-
calculate_area_intersection_weights(source[c("source_id", "geom")], 
                                    target[c("target_id", "geom")], 
                                    normalize = TRUE, allow_lonlat = TRUE))
(res <-
merge(st_drop_geometry(source), w, by = "source_id"))

sum(res$val * res$w) / sum(res$w)

library(sf)

source <- st_sf(source_id = c(1, 2), 
                val = c(10, 20), 
                geom = st_as_sfc(c(
  "POLYGON ((0.2 1.2, 1.8 1.2, 1.8 2.8, 0.2 2.8, 0.2 1.2))", 
  "POLYGON ((-1.96 1.04, -0.04 1.04, -0.04 2.96, -1.96 2.96, -1.96 1.04))")))

source$area <- as.numeric(st_area(source))

target <- st_sf(target_id = "a", 
                geom = st_as_sfc("POLYGON ((-1.2 1, 0.8 1, 0.8 3, -1.2 3, -1.2 1))"))

plot(source['val'], reset = FALSE)
plot(st_geometry(target), add = TRUE)

(w <- 
calculate_area_intersection_weights(source[c("source_id", "geom")], 
                                    target[c("target_id", "geom")], 
                                    normalize = FALSE, allow_lonlat = TRUE))

(res <-
merge(st_drop_geometry(source), w, by = "source_id"))

sum(res$val * res$w * res$area) / sum(res$w * res$area)

(w <-
calculate_area_intersection_weights(source[c("source_id", "geom")], 
                                    target[c("target_id", "geom")], 
                                    normalize = TRUE, allow_lonlat = TRUE))
(res <-
merge(st_drop_geometry(source), w, by = "source_id"))

sum(res$val * res$w) / sum(res$w)

Create Cell Geometry

Description

Creates cell geometry from vectors of X and Y positions.

Usage

create_cell_geometry(
  X_coords,
  Y_coords,
  prj,
  geom = NULL,
  buffer_dist = 0,
  regularize = FALSE,
  eps = 1e-10
)
create_cell_geometry(
  X_coords,
  Y_coords,
  prj,
  geom = NULL,
  buffer_dist = 0,
  regularize = FALSE,
  eps = 1e-10
)

Arguments

`X_coords`	numeric center positions of X axis indices
`Y_coords`	numeric center positions of Y axis indices
`prj`	character proj4 string for x and y
`geom`	sf data.frame with geometry that cell geometry should cover
`buffer_dist`	numeric a distance to buffer the cell geometry in units of geom projection
`regularize`	boolean if TRUE, grid spacing will be adjusted to be exactly equal. Only applies to 1-d coordinates.
`eps`	numeric sets tolerance for grid regularity.

Details

Intersection is performed with cell centers then geometry is constructed. A buffer may be required to fully cover geometry with cells.

Examples

dir <- tempdir()
ncf <- file.path(dir, "metdata.nc") 

try(zip::unzip(system.file("extdata/metdata.zip", package = "ncdfgeom"), exdir = dir))

if(file.exists(ncf)) {

nc <- RNetCDF::open.nc(ncf)
ncmeta::nc_vars(nc)
variable_name <- "precipitation_amount"
cv <- ncmeta::nc_coord_var(nc, variable_name)

x <- RNetCDF::var.get.nc(nc, cv$X, unpack = TRUE)
y <- RNetCDF::var.get.nc(nc, cv$Y, unpack = TRUE)

prj <- ncmeta::nc_gm_to_prj(ncmeta::nc_grid_mapping_atts(nc))

geom <- sf::read_sf(system.file("shape/nc.shp", package = "sf"))
geom <- sf::st_transform(geom, 5070)

cell_geometry <- create_cell_geometry(x, y, prj, geom, 0)

plot(sf::st_geometry(cell_geometry), lwd = 0.25)
plot(sf::st_transform(sf::st_geometry(geom), prj), add = TRUE)

}

dir <- tempdir()
ncf <- file.path(dir, "metdata.nc") 

try(zip::unzip(system.file("extdata/metdata.zip", package = "ncdfgeom"), exdir = dir))

if(file.exists(ncf)) {

nc <- RNetCDF::open.nc(ncf)
ncmeta::nc_vars(nc)
variable_name <- "precipitation_amount"
cv <- ncmeta::nc_coord_var(nc, variable_name)

x <- RNetCDF::var.get.nc(nc, cv$X, unpack = TRUE)
y <- RNetCDF::var.get.nc(nc, cv$Y, unpack = TRUE)

prj <- ncmeta::nc_gm_to_prj(ncmeta::nc_grid_mapping_atts(nc))

geom <- sf::read_sf(system.file("shape/nc.shp", package = "sf"))
geom <- sf::st_transform(geom, 5070)

cell_geometry <- create_cell_geometry(x, y, prj, geom, 0)

plot(sf::st_geometry(cell_geometry), lwd = 0.25)
plot(sf::st_transform(sf::st_geometry(geom), prj), add = TRUE)

}

Read attribute dataframe from NetCDF-DSG file

Description

Gets attribute data from a NetCDF-DSG file and returns it in a data.frame. This function is intended as a convenience to be used within workflows where the netCDF file is already open and well understood.

Usage

read_attribute_data(nc, instance_dim)
read_attribute_data(nc, instance_dim)

Arguments

`nc`	A NetCDF path or urlto be opened.
`instance_dim`	The NetCDF instance/station dimension.

Examples

hucPolygons <- sf::read_sf(system.file('extdata','example_huc_eta.json', package = 'ncdfgeom'))
hucPolygons_nc <- ncdfgeom::write_geometry(tempfile(), hucPolygons)

read_attribute_data(hucPolygons_nc, "instance")

hucPolygons <- sf::read_sf(system.file('extdata','example_huc_eta.json', package = 'ncdfgeom'))
hucPolygons_nc <- ncdfgeom::write_geometry(tempfile(), hucPolygons)

read_attribute_data(hucPolygons_nc, "instance")

Read NetCDF-CF spatial geometries

Description

Attempts to convert a NetCDF-CF DSG Simple Geometry file into a sf data.frame.

Usage

read_geometry(nc_file)
read_geometry(nc_file)

Arguments

nc_file

character file path to the nc file to be read.

Value

sf data.frame containing spatial geometry of type found in the NetCDF-CF DSG file.

References

http://cfconventions.org/index.html

http://cfconventions.org/cf-conventions/cf-conventions.html#_features_and_feature_types

Examples

huc_eta_nc <- tempfile()
file.copy(system.file('extdata','example_huc_eta.nc', package = 'ncdfgeom'), 
         huc_eta_nc, overwrite = TRUE)
         
vars <- ncmeta::nc_vars(huc_eta_nc)

hucPolygons <- sf::read_sf(system.file('extdata','example_huc_eta.json', package = 'ncdfgeom'))
plot(sf::st_geometry(hucPolygons))
names(hucPolygons)

hucPolygons_nc <- ncdfgeom::write_geometry(nc_file=huc_eta_nc, 
                                          geom_data = hucPolygons, 
                                          instance_dim_name = "station", 
                                          variables = vars$name)
huc_poly <- read_geometry(huc_eta_nc)
plot(sf::st_geometry(huc_poly))
names(huc_poly)

huc_eta_nc <- tempfile()
file.copy(system.file('extdata','example_huc_eta.nc', package = 'ncdfgeom'), 
         huc_eta_nc, overwrite = TRUE)
         
vars <- ncmeta::nc_vars(huc_eta_nc)

hucPolygons <- sf::read_sf(system.file('extdata','example_huc_eta.json', package = 'ncdfgeom'))
plot(sf::st_geometry(hucPolygons))
names(hucPolygons)

hucPolygons_nc <- ncdfgeom::write_geometry(nc_file=huc_eta_nc, 
                                          geom_data = hucPolygons, 
                                          instance_dim_name = "station", 
                                          variables = vars$name)
huc_poly <- read_geometry(huc_eta_nc)
plot(sf::st_geometry(huc_poly))
names(huc_poly)

Read NetCDF-CF timeSeries featuretype

Description

This function reads a timeseries discrete sampling geometry NetCDF file and returns a list containing the file's contents.

Usage

read_timeseries_dsg(nc_file, read_data = TRUE)
read_timeseries_dsg(nc_file, read_data = TRUE)

Arguments

`nc_file`	character file path to the nc file to be read.
`read_data`	logical whether to read metadata only or not.

Details

The current implementation checks several NetCDF-CF specific conventions prior to attempting to read the file. The Conventions and featureType global attributes are checked but not strictly required.

Variables with standard_name and/or cf_role of station_id and/or timeseries_id are searched for to indicate which variable is the 'timeseries identifier'. The function stops if one is not found.

All variables are introspected for a coordinates attribute. This attribute is used to determine which variables are coordinate variables. If none are found an attempt to infer data variables by time and timeseries_id dimensions is made.

The coordinates variables are introspected and their standard_names used to determine which coordinate they are. Lat, lon, and time are required, height is not.

Variables with a coordinates attribute are assumed to be the 'data variables'.

Data variables are traversed and their metadata and data content put into lists within the main response list.

See the timeseries vignette for more information.

Value

list containing the contents of the NetCDF file.

References

https://www.unidata.ucar.edu/software/netcdf-java/v4.6/reference/FeatureDatasets/CFpointImplement.html

Write attribute data to NetCDF-CF

Description

Creates a NetCDF file with an instance dimension, and any attributes from a data frame. Use to create the start of a NetCDF-DSG file. One character length dimension is created long enough to contain the longest provided character string. This function does not implement any CF convention attributes or standard names. Any columns of class date will be converted to character.

Usage

write_attribute_data(
  nc_file,
  att_data,
  instance_dim_name = "instance",
  units = rep("unknown", ncol(att_data)),
  overwrite = FALSE
)
write_attribute_data(
  nc_file,
  att_data,
  instance_dim_name = "instance",
  units = rep("unknown", ncol(att_data)),
  overwrite = FALSE
)

Arguments

`nc_file`	`character` file path to the nc file to be created. If adding to a file, it must already have the named instance dimension.
`att_data`	`data.frame` with instances as columns and attributes as rows.
`instance_dim_name`	`character` name for the instance dimension. Defaults to "instance"
`units`	`character` vector with units for each column of att_data. Defaults to "unknown" for all.
`overwrite`	boolean overwrite existing file? Will append if FALSE.

Examples

sample_data <- sf::st_set_geometry(sf::read_sf(system.file("shape/nc.shp", 
                                                           package = "sf")), 
                                   NULL)
example_file <-write_attribute_data(tempfile(), sample_data,
                                    units = rep("unknown", ncol(sample_data)))

try({
  ncdump <- system(paste("ncdump -h", example_file), intern = TRUE)
  cat(ncdump ,sep = "\n")
}, silent = TRUE)

sample_data <- sf::st_set_geometry(sf::read_sf(system.file("shape/nc.shp", 
                                                           package = "sf")), 
                                   NULL)
example_file <-write_attribute_data(tempfile(), sample_data,
                                    units = rep("unknown", ncol(sample_data)))

try({
  ncdump <- system(paste("ncdump -h", example_file), intern = TRUE)
  cat(ncdump ,sep = "\n")
}, silent = TRUE)

Write geometries and attributes to NetCDF-CF

Description

Creates a file with point, line or polygon instance data ready for the extended NetCDF-CF timeSeries featuretype format.

Will also add attributes if provided data has them.

Usage

write_geometry(
  nc_file,
  geom_data,
  instance_dim_name = NULL,
  variables = list()
)
write_geometry(
  nc_file,
  geom_data,
  instance_dim_name = NULL,
  variables = list()
)

Arguments

`nc_file`	`character` file path to the nc file to be created.
`geom_data`	sf `data.frame` with POINT, LINESTRING, MULTILINESTRING, POLYGON, or MULTIPOLYGON geometries. Note that three dimensional geometries are not supported.
`instance_dim_name`	`character` Not required if adding geometry to a NetCDF-CF Discrete Sampling Geometries timeSeries file. For a new file, will use package default – "instance" – if not supplied.
`variables`	`character` If a an existing netCDF files is provided, this list of variables that should be related to the geometries.

References

http://cfconventions.org/cf-conventions/cf-conventions.html

Examples


hucPolygons <- sf::read_sf(system.file('extdata','example_huc_eta.json', package = 'ncdfgeom'))

hucPolygons_nc <- ncdfgeom::write_geometry(nc_file=tempfile(), 
                                           geom_data = hucPolygons)
try({
  ncdump <- system(paste("ncdump -h", hucPolygons_nc), intern = TRUE)
  cat(ncdump ,sep = "\n")
}, silent = TRUE)

hucPolygons <- sf::read_sf(system.file('extdata','example_huc_eta.json', package = 'ncdfgeom'))

hucPolygons_nc <- ncdfgeom::write_geometry(nc_file=tempfile(), 
                                           geom_data = hucPolygons)
try({
  ncdump <- system(paste("ncdump -h", hucPolygons_nc), intern = TRUE)
  cat(ncdump ,sep = "\n")
}, silent = TRUE)

Write time series to NetCDF-CF

Description

This function creates a timeseries discrete sampling geometry NetCDF file. It uses the orthogonal array encoding to write one data.frame per function call. This encoding is best suited to data with the same number of timesteps per instance (e.g. geometry or station).

Usage

write_timeseries_dsg(
  nc_file,
  instance_names,
  lats,
  lons,
  times,
  data,
  alts = NA,
  data_unit = "",
  data_prec = "double",
  data_metadata = list(name = "data", long_name = "unnamed data"),
  time_units = "days since 1970-01-01 00:00:00",
  instance_dim_name = "instance",
  dsg_timeseries_id = "instance_name",
  coordvar_long_names = list(instance = "Station Names", time = "time of measurement",
    lat = "latitude of the measurement", lon = "longitude of the measurement", alt =
    "altitude of the measurement"),
  attributes = list(),
  add_to_existing = FALSE,
  overwrite = FALSE
)
write_timeseries_dsg(
  nc_file,
  instance_names,
  lats,
  lons,
  times,
  data,
  alts = NA,
  data_unit = "",
  data_prec = "double",
  data_metadata = list(name = "data", long_name = "unnamed data"),
  time_units = "days since 1970-01-01 00:00:00",
  instance_dim_name = "instance",
  dsg_timeseries_id = "instance_name",
  coordvar_long_names = list(instance = "Station Names", time = "time of measurement",
    lat = "latitude of the measurement", lon = "longitude of the measurement", alt =
    "altitude of the measurement"),
  attributes = list(),
  add_to_existing = FALSE,
  overwrite = FALSE
)

Arguments

`nc_file`	`character` file path to the nc file to be created.
`instance_names`	`character` or `numeric` vector of names for each instance (e.g. station or geometry) to be added to the file.
`lats`	`numeric` vector of latitudes
`lons`	`numeric` vector of longitudes
`times`	`POSIXct` vector of times. Must be of type `POSIXct` or an attempt to convert it will be made using `as.POSIXct(times)`.
`data`	`data.frame` with each column corresponding to an instance. Rows correspond to time steps. nrow must be the same length as times. Column names must match instance names.
`alts`	`numeric` vector of altitudes (m above sea level) (Optional)
`data_unit`	`character` vector of data units. Length must be the same as number of columns in `data` parameter.
`data_prec`	`character` precision of observation data in NetCDF file. Valid options: 'short' 'integer' 'float' 'double' 'char'.
`data_metadata`	`list` A named list of strings: list(name='ShortVarName', long_name='A Long Name')
`time_units`	`character` units string in udunits format to use for time. Defaults to 'days since 1970-01-01 00:00:00'
`instance_dim_name`	the `character` name to use for the instance used in 'instance_names'
`dsg_timeseries_id`	the `character` name to use for the instance used in the timeseries id
`coordvar_long_names`	`list` values for long names on coordinate variables. Names should be 'instance', time', 'lat', 'lon', and 'alt.'
`attributes`	list An optional list of attributes that will be added at the global level. See details for useful attributes.
`add_to_existing`	`boolean` If TRUE and the file already exists, variables will be added to the existing file. See details for more.
`overwrite`	boolean unless set to true, error if file exists.

Details

Suggested Global Variables: c(title = "title", abstract = "history", provider site = "institution", provider name ="source", description = "description")

Note regarding add_to_existing: add_to_existing = TRUE should only be used to add variables to an existing NetCDF discrete sampling geometry file. All other inputs should be the same as are already in the file. If the functions is called with add_to_existing=FALSE (the default), it will overwrite an existing file with the same name. The expected usage is to call this function repeatedly only changing the data, data_unit, data_prec and data_metadata inputs.

See the timeseries vignette for more information.

Package 'ncdfgeom'

Help Index

Area Weighted Intersection

Description

Usage

Arguments

Details

Value

Examples

Create Cell Geometry

Description

Usage

Arguments

Details

Examples

Read attribute dataframe from NetCDF-DSG file

Description

Usage

Arguments

Examples

Read NetCDF-CF spatial geometries

Description

Usage

Arguments

Value

References

Examples

Read NetCDF-CF timeSeries featuretype

Description

Usage

Arguments

Details

Value

References

Write attribute data to NetCDF-CF

Description

Usage

Arguments

Examples

Write geometries and attributes to NetCDF-CF

Description

Usage

Arguments

References

Examples

Write time series to NetCDF-CF

Description

Usage

Arguments

Details

References