Normalization constants for edge features during inference

Hi @vxfung! 

Hoping you are well!

Just wanted to bring to notice a minor issue I faced while trying to integrate MatDeepLearn with one of my custom packages. Currently, while processing the ASE structures, edge normalization happens as per the [interatomic distance distribution of the input data features](https://github.com/Fung-Lab/MatDeepLearn/blob/31d49d48cce932e620a4c5de82575ae209304b60/matdeeplearn/process/process.py#LL647C32-L647C32). 

However, for the prediction or inference phase, this should not be the case. Especially, I found that for smaller datasets where crystal structures were similar, the normalization constants could be different from what were used for learning, and that resulted in inconsistent predictions. This is likely to happen only in cases where we have a set of crystal structures with interatomic distances less than the ones set in graph processing; and therefore, only when you are predicting properties for a small set of similar structures. 

A couple of proposed fixes:
- Set the Min-Max normalization constants to be the same as what is set in the graph processing config
OR
- Enforce it as a required field for inference/prediction jobs

Cheers!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Normalization constants for edge features during inference #13

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Normalization constants for edge features during inference #13

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions