NOAA   ERDDAP   Easier access to scientific data Brought to you by NOAA NMFS SFSC ERD    

Working with the datasets.xml File

[This web page will only be of interest to ERDDAP administrators.]
After you have followed the ERDDAP installation instructions, you must edit the datasets.xml file
in <tomcat>/content/erddap/ to describe the datasets that your ERDDAP installation will serve.

Some Assembly Required - Setting up a dataset in ERDDAP isn't just a matter of pointing to the dataset's directory or URL.
You have to write a chunk of XML for datasets.xml which describes the dataset.

If you buy into these ideas and expend the effort to create the XML for datasets.xml,
you get all the advantages of ERDDAP, including: Making the datasets.xml takes considerable effort for the first few datasets, but it gets easier.
After the first dataset, you can use often re-use a lot of your work for the next dataset.
Fortunately, there are two Tools to help you create the XML for each dataset in datasets.xml.

Tools - There are two command line programs which are tools to help you create the XML
for each dataset that you want your ERDDAP to serve.
Once you have ERDDAP installed in Tomcat and Tomcat has unpacked the erddap.war file,
you can find these programs in the <tomcat>/webapps/erddap/WEB-INF directory.
There are Linux/Unix shell scripts (the program name, with no extension) and
Windows .bat files for each program.
When you run each program, it will ask you questions.
For each question, type a response, then press Enter. (How quaint! It's just like the old days.)
Or press ^C to exit a program at any time.

The two tools are a big help, but you still must read all of these instructions on this page carefully
and make important decisions yourself.

The basic structure of the datasets.xml file is:

<?xml version="1.0" encoding="ISO-8859-1" ?>
<erddapDatasets>
    <dataset>...</dataset> <!-- 1 or more -->
    <user username="..." password="..." roles="..." /> <!-- 0 or more -->
    <requestBlacklist>...</requestBlacklist> <!-- 0 or 1 -->
    <subscriptionEmailBlacklist>...</subscriptionEmailBlacklist> <!-- 0 or 1 -->
</erddapDatasets>
It is possible that other encodings will be allowed in the future, but for now, only ISO-8859-1 is recommended.

Several types of datasets are supported. They fall into two categories. (Why?)

See also the Tag and Attribute Descriptions.

Notes

Working with the datasets.xml file is a non-trivial project. Please read this entire web page carefully, especially these notes.

Skeleton XML for each Type of Dataset

EDDGridFromDap handles grid variables from DAP servers.

EDDGridFromErddap handles gridded data from a remote ERDDAP server.
EDDTableFromErddap handles tabular data from a remote ERDDAP server.

EDDGridFromEtopo just serves the ETOPO topography data which is distributed with ERDDAP.

EDDGridFromFiles is the superclass of all EDDGridFrom...Files classes.
You can't use EDDGridFromFiles directly.
Instead, you can use subclasses of EDDGridFromFiles to handle specific file types:

It should be relatively easy to support other file types. Contact us if you have requests.

Details - The following information applies to all of the subclasses of EDDGridFromFiles.

EDDGridFromNcFiles aggregates data from local, gridded, GRIB .grb and .grb2 files, HDF .hdf 4 (and 5?) files, NetCDF .nc files.
This may work with other file types (e.g., BUFR), we just haven't tested it -- please send us some sample files.
See this class' superclass, EDDGridFromFiles, for details.

Note that for GRIB files, ERDDAP will make a .gbx index file the first time it reads each GRIB file.
So the GRIB files must be in a directory where the "user" that ran Tomcat has read+write permission.

EDDGridSideBySide aggregates two or more EDDGrid datasets (the children) side by side.

EDDGridAggregateExistingDimension aggregates two or more EDDGrid datasets based on different values of the first dimension.

EDDGridCopy makes and maintains a local copy of another EDDGrid's data and serves data from the local copy.

EDDTableFromBMDE handles data from a BMDE server.

EDDTableFromDapSequence handles variables within 1- and 2-level sequences from DAP servers such as DAPPER.

EDDTableFromDatabase handles data from one database table or view.

EDDTableFromFiles is the superclass of all EDDTableFrom...Files classes.
You can't use EDDTableFromFiles directly.
Instead, you can use subclasses of EDDTableFromFiles to handle specific file types:

Details - The following information applies to all of the subclasses of EDDTableFromFiles.

EDDTableFromHyraxFiles aggregates data files with several variables, each with one or more shared dimensions (e.g., time, altitude, latitude, longitude), and served by a Hyrax OPeNDAP server.

EDDTableFromNcFiles aggregates data from .nc files with several variables, each with one or more shared dimensions (e.g., time, altitude, latitude, longitude).
In all cases, each file has multiple values for the leftmost dimension, e.g. time.
In all cases, each file has a single value for the other dimensions (e.g., altitude, latitude, longitude).
The files may have character variables with an additional dimension (e.g., nCharacters).
See this class' superclass, EDDTableFromFiles, for details.

EDDTableFromNOS handles data from a NOAA NOS source, which uses SOAP+XML for requests and responses. It is very specific to NOAA NOS's XML. See the sample EDDTableFromNOS dataset in datasets2.xml.

EDDTableFromOBIS handles data from an OBIS server.

EDDTableFromSOS handles data from a Sensor Observation Service (SWE/SOS) server.

EDDTableCopy makes and maintains a local copy of another EDDTable's data and serves data from the local copy.


Tag and Attribute Descriptions

Here are descriptions of some of the tags which are used by more than one dataset type.

DISCLAIMER OF ENDORSEMENT
Any reference obtained from this server to a specific commercial product, process, or service does not constitute or imply an endorsement by CoastWatch, NOAA, or the United States Government of the product, process, or service, or its producer or provider. The views and opinions expressed in any referenced document do not necessarily state or reflect those of CoastWatch, ERD, NOAA, or the United States Government.

DISCLAIMER FOR EXTERNAL LINKS
The appearance of external links on this World Wide Web site does not constitute endorsement by the Department of Commerce/National Oceanic and Atmospheric Administration of external Web sites or the information, products or services contained therein. For other than authorized activities, the Department of Commerce/NOAA does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this Department of Commerce/NOAA Web site.

DISCLAIMER OF LIABILITY
Neither the data providers, ERD, CoastWatch, NOAA, nor the United States Government, nor any of their employees or contractors, makes any warranty, express or implied, including warranties of merchantability and fitness for a particular purpose, or assumes any legal liability for the accuracy, completeness, or usefulness, of any information at this site.

USAGE LIMITATIONS
The SeaWiFS images and data from this site may be used for free, but not redistributed; all other images and data from this site may be used and redistributed for free.

CONTACT
Please email questions, comments, or suggestions regarding this web page to bob.simons@noaa.gov. We would really appreciate knowing if you use ERDDAP.
If a data request doesn't work and you think it should, please send the request URL?query (the String from the "Just generate the URL" textfield in the Data Access Form).


NOAA NMFS SFSC ERD Privacy Policy