EnzymeML was developed to support data acquisition, data analysis, and sharing of data by providing a standardized exchange format for enzymatic data. EnzymeML follows the Standards of Reporting Enzymology Data (STRENDA) Guidelines, a comprehensive set of metadata describing reaction conditions and kinetic models. EnzymeML is written in eXtensible Markup Language (XML).
It builds on the well-established Systems Biology Markup Language (SBML) and includes information about the enzyme, the substrate(s) and product(s), the reaction conditions, the selected kinetic model, and estimated kinetic parameters. The measured time course of substrate or product concentrations is stored in a comma-separated value (CSV) formatted file. The XML- and the CSV file are combined into a single EnzymeML document using the widely-used OMEX format.
The typical user is not expected to read or write EnzymeML documents directly, but to use software to generate EnzymeML documents, which are then used as a standardized exchange format to transfer data between applications. Therefore, an Application Programming Interface (API) was developed to read, write, and edit EnzymeML documents.
The API is a Python library (PyEnzyme on GitHub), which can be installed locally or can be accessed as a RESTful Web service. This Web service can be used by any application such as an electronic laboratory notebook (ELN), a modeling platform, or a specialized database to read or write EnzymeML documents. Upon reading, writing, and editing EnzymeML documents, the API controls data completeness and consistency, such as checking that scalar properties such as pH are within a given range. Additional validation tools control compatibility with SBML or with minimum requirements of applications such as STRENDA DB, SABIO-RK, or COPASI.
An EnzymeML document includes information about the reaction conditions (e.g. enzyme, substrates, products, pH, temperature) and the measured concentrations of substrate or product at different time points. Currently, we provide two alternatives routes on how to create an EnzymeML document:
As the second step, the EnzymeML document generated upon data acquisition is used as starting point for modeling the enzyme kinetics. Currently, we provide two alternative routes on how to model the kinetics from experimental data by using the dedicated COPASI or PySCeS Thin Layer interfaces (see source code). See the following example and scenario from our recent publication for more details.
Finally, the complete data of the enzyme-catalyzed reaction (reaction conditions, time course data, selected kinetic model, estimated kinetic parameters) is uploaded to an EnzymeML Dataverse or a specialized database such as SABIO-RK: