dictdump: Dumping and loading dictionaries

This module offers a set of functions to dump a python dictionary indexed by text strings to following file formats: HDF5, INI, JSON

dicttoh5(treedict, h5file, h5path='/', mode='w', overwrite_data=False, create_dataset_args=None)[source]

Write a nested dictionary to a HDF5 file, using keys as member names.

If a dictionary value is a sub-dictionary, a group is created. If it is any other data type, it is cast into a numpy array and written as a h5py dataset. Dictionary keys must be strings and cannot contain the / character.

If dictionary keys are tuples they are interpreted to set h5 attributes. The tuples should have the format (dataset_name,attr_name)

Note

This function requires h5py to be installed.

Parameters:
  • treedict – Nested dictionary/tree structure with strings or tuples as keys and array-like objects as leafs. The "/" character can be used to define sub trees. If tuples are used as keys they should have the format (dataset_name,attr_name) and will add a 5h attribute with the corresponding value.
  • h5file – HDF5 file name or handle. If a file name is provided, the function opens the file in the specified mode and closes it again before completing.
  • h5path – Target path in HDF5 file in which scan groups are created. Default is root ("/")
  • mode – Can be "r+" (read/write, file must exist), "w" (write, existing file is lost), "w-" (write, fail if exists) or "a" (read/write if exists, create otherwise). This parameter is ignored if h5file is a file handle.
  • overwrite_data – If True, existing groups and datasets can be overwritten, if False they are skipped. This parameter is only relevant if h5file_mode is "r+" or "a".
  • create_dataset_args – Dictionary of args you want to pass to h5f.create_dataset. This allows you to specify filters and compression parameters. Don’t specify name and data.

Example:

from silx.io.dictdump import dicttoh5

city_area = {
    "Europe": {
        "France": {
            "Isère": {
                "Grenoble": 18.44,
                ("Grenoble","unit"): "km2"
            },
            "Nord": {
                "Tourcoing": 15.19,
                ("Tourcoing","unit"): "km2"
            },
        },
    },
}

create_ds_args = {'compression': "gzip",
                  'shuffle': True,
                  'fletcher32': True}

dicttoh5(city_area, "cities.h5", h5path="/area",
         create_dataset_args=create_ds_args)
dicttonx(treedict, h5file, h5path='/', mode='w', overwrite_data=False, create_dataset_args=None)[source]

Write a nested dictionary to a HDF5 file, using string keys as member names. The NeXus convention is used to identify attributes with "@" character, therefor the dataset_names should not contain "@".

Parameters:treedict – Nested dictionary/tree structure with strings as keys and array-like objects as leafs. The "/" character can be used to define sub tree. The "@" character is used to write attributes.

Detais on all other params can be found in doc of dicttoh5.

Example:

import numpy
from silx.io.dictdump import dicttonx

gauss = {
    "entry":{
        "title":u"A plot of a gaussian",
        "plot": {
            "y": numpy.array([0.08, 0.19, 0.39, 0.66, 0.9, 1.,
                          0.9, 0.66, 0.39, 0.19, 0.08]),
            "x": numpy.arange(0,1.1,.1),
            "@signal": "y",
            "@axes": "x",
            "@NX_class":u"NXdata",
            "title:u"Gauss Plot",
         },
         "@NX_class":u"NXentry",
         "default":"plot", 
    }
    "@NX_class": u"NXroot",
    "@default": "entry",
}

dicttonx(gauss,"test.h5")
h5todict(h5file, path='/', exclude_names=None, asarray=True)[source]

Read a HDF5 file and return a nested dictionary with the complete file structure and all data.

Example of usage:

from silx.io.dictdump import h5todict

# initialize dict with file header and scan header
header94 = h5todict("oleg.dat",
                    "/94.1/instrument/specfile")
# add positioners subdict
header94["positioners"] = h5todict("oleg.dat",
                                   "/94.1/instrument/positioners")
# add scan data without mca data
header94["detector data"] = h5todict("oleg.dat",
                                     "/94.1/measurement",
                                     exclude_names="mca_")

Note

This function requires h5py to be installed.

Note

If you write a dictionary to a HDF5 file with dicttoh5() and then read it back with h5todict(), data types are not preserved. All values are cast to numpy arrays before being written to file, and they are read back as numpy arrays (or scalars). In some cases, you may find that a list of heterogeneous data types is converted to a numpy array of strings.

Parameters:
  • h5file – File name or h5py.File object or spech5 file or fabioh5 file.
  • path (str) – Name of HDF5 group to use as dictionary root level, to read only a sub-group in the file
  • exclude_names (List[str]) – Groups and datasets whose name contains a string in this list will be ignored. Default is None (ignore nothing)
  • asarray (bool) – True (default) to read scalar as arrays, False to read them as scalar
Returns:

Nested dictionary

dicttojson(ddict, jsonfile, indent=None, mode='w')[source]

Serialize ddict as a JSON formatted stream to jsonfile.

Parameters:
  • ddict – Dictionary (or any object compatible with json.dump).
  • jsonfile – JSON file name or file-like object. If a file name is provided, the function opens the file in the specified mode and closes it again.
  • indent – If indent is a non-negative integer, then JSON array elements and object members will be pretty-printed with that indent level. An indent level of 0 will only insert newlines. None (the default) selects the most compact representation.
  • mode – File opening mode (w, a, w+…)
dicttoini(ddict, inifile, mode='w')[source]

Output dict as configuration file (similar to Microsoft Windows INI).

Parameters:
  • dict – Dictionary of configuration parameters
  • inifile – INI file name or file-like object. If a file name is provided, the function opens the file in the specified mode and closes it again.
  • mode – File opening mode (w, a, w+…)
dump(ddict, ffile, mode='w', fmat=None)[source]

Dump dictionary to a file

Parameters:
  • ddict – Dictionary with string keys
  • ffile – File name or file-like object with a write method
  • fmat (str) –

    Output format: "json", "hdf5" or "ini". When None (the default), it uses the filename extension as the format. Dumping to a HDF5 file requires h5py to be installed.

  • mode (str) – File opening mode (w, a, w+…) Default is “w”, write mode, overwrite if exists.
Raises:

IOError – if file format is not supported

load(ffile, fmat=None)[source]

Load dictionary from a file

When loading from a JSON or INI file, an OrderedDict is returned to preserve the values’ insertion order.

Parameters:
  • ffile – File name or file-like object with a read method
  • fmat

    Input format: json, hdf5 or ini. When None (the default), it uses the filename extension as the format. Loading from a HDF5 file requires h5py to be installed.

Returns:

Dictionary (ordered dictionary for JSON and INI)

Raises:

IOError – if file format is not supported