API Overview¶

Building a data model from an XSD file¶

DataModel: use directly the constructor to create an instance of a data model.

Note

You should always use the DataModel constructor to create a new instance instead of trying to change an instance's attributes, as internal objects are created when the constructor is called.

Inspecting the data model¶

DataModel.source_tree: see the data model in tree format before any transformation
DataModel.target_tree: see the data model in tree format after simplification (corresponding to the data model which will be created in the database)
DataModel.get_entity_rel_diagram: get a visual representation of the data model using Mermaid
DataModel.get_all_create_table_statements: get SQLAlchemy CREATE TABLE statements that can be printed for detailed inspection

Loading data into the database¶

DataModel.parse_xml: read and parse a XML document, which is loaded in memory
Document.insert_into_target_tables: load a file into the database; returns a LoadStats object with inserted/existing row counts and per-phase durations

Advanced use: loading data into the database¶

The flow chart below presents data conversions used to load an XML file into the database, showing the functions used for lower level steps. It can be useful for advanced use cases, for instance:

transforming the data in intermediate steps,
adding logging,
limiting concurrent access to the database in a multiprocessing context, etc.

For those scenarios you can easily reimplement Document.insert_into_target_tables to suit your needs, using lower level functions.

flowchart TB
    subgraph "<a href='../data_model/#xml2db.model.DataModel.parse_xml' style='color:var(--md-code-fg-color)'>DataModel.parse_xml</a>"
        direction TB
        A[XML file]-- "<a href='../xml_converter/#xml2db.xml_converter.XMLConverter.parse_xml' style='color:var(--md-code-fg-color)'>XMLConverter.parse_xml</a>" -->B[Document tree]
        B-- "<a href='../document/#xml2db.document.Document.doc_tree_to_flat_data' style='color:var(--md-code-fg-color)'>Document.doc_tree_to_flat_data</a>" -->C[Flat data model]
    end
    C -.- D
    subgraph "<a href='../document/#xml2db.document.Document.insert_into_target_tables' style='color:var(--md-code-fg-color)'>Document.insert_into_target_tables</a>"
        direction TB
        D[Flat data model]-- "<a href='../document/#xml2db.document.Document.insert_into_temp_tables' style='color:var(--md-code-fg-color)'>Document.insert_into_temp_tables</a>" -->E[Temporary tables]
        E-- "<a href='../document/#xml2db.document.Document.merge_into_target_tables' style='color:var(--md-code-fg-color)'>Document.merge_into_target_tables</a>" -->F[Target tables]
    end

Advanced use: get data from the database back to XML¶

The flow chart below presents data conversions used to get back data from the database into XML, showing the functions used for lower level steps.

flowchart TB
    subgraph "<a href='../data_model/#xml2db.model.DataModel.extract_from_database' style='color:var(--md-code-fg-color)'>DataModel.extract_from_database</a>"
        direction TB
        A[Database]-->B[Flat data model]    
    end
    B -.- C
    subgraph "<a href='../document/#xml2db.document.Document.to_xml' style='color:var(--md-code-fg-color)'>Document.to_xml</a>" 
        direction TB
        C[Flat data model]-- "<a href='../document/#xml2db.document.Document.flat_data_to_doc_tree' style='color:var(--md-code-fg-color)'>Document.flat_data_to_doc_tree</a>" -->D[Document tree]
        D-- "<a href='../xml_converter/#xml2db.xml_converter.XMLConverter.to_xml' style='color:var(--md-code-fg-color)'>XMLConverter.to_xml</a>" -->E[XML file]
    end