Skip to content

Revise Serialized Model Structure for Standardization #40

@Gnpd

Description

@Gnpd

Review and refine the structure of the serialized model to align with possible existing serialization standards. The current format includes fields such as estimator_class, attributes, params,... and metadata like producer_version. Explore established standards (e.g., ONNX, PMML) to ensure compatibility and extensibility. Propose a revised structure that balances clarity, completeness, and interoperability.

Current format

serialized_estimator = {
    "estimator_class": model.__class__.__name__,  #str
    "attributes": serializable_attributes, #dict  loadings: [1,2,3]
    "attribute_types": attribute_types, #dict ndarray
    "attribute_dtypes": attribute_dtypes, #dict float64
    "params": self.convert_to_serializable(params), #dict
    "param_types": param_types, #dict
    "param_dtypes": param_dtypes, #dict
    "producer_version": getattr(model, "_sklearn_version", None),
    "producer_name": model.__module__.split(".")[0],
    "domain": "sklearn",
}

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    Status

    Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions