Several questions after trying out these datasets #14

jumxglhf · 2023-03-05T07:15:30Z

jumxglhf
Mar 5, 2023

Dear IGB authors,

Thanks for releasing these amazing datasets that help promote the graph community!

I am excited and can't wait to try out some of those datasets. Below is one question I have:

I tried the tiny (100K) and small (1M) versions and MLPs all easily outperform GNNs with the same amount of parameters, which is really strange. Usually, we should expect a 10-40% performance downgrade when the graph structures are missing. This phenomenon indicates that graph structures in this dataset are detrimental to the performance of node classification.

Could you please help me resolve my question?

Best,
Mingxuan

jumxglhf · 2023-03-05T07:23:50Z

jumxglhf
Mar 5, 2023
Author

Besides, I also tried graphs with random node features. In this case, GNNs outperform MLPs by 2% but compared with the provided features, the performance is down by 45%.

0 replies

akhatua2 · 2023-03-05T18:29:09Z

akhatua2
Mar 5, 2023
Maintainer

Hi Mingxuan,
Thank you for testing the MLP performance of the dataset. It is quite interesting to see MLP performing so well. I just ran a MLP vs GCN vs SAGE for IGB-tiny and noticed these results:
I ran each of them for 10 epochs at lr=0.01 and tried normalizing the parameters.

Model	Test Acc	Model Size
MLP	63.0%	1.073MB
GCN	64.42%	1.020MB
SAGE	69.38%	2.038MB

Here are the models:

MLP(
  (layer_1): Linear(in_features=1024, out_features=256, bias=True)
  (layer_2): Linear(in_features=256, out_features=64, bias=True)
  (layer_out): Linear(in_features=64, out_features=19, bias=True)
  (relu): ReLU()
  (dropout): Dropout(p=0.2, inplace=False)
  (batchnorm1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  (batchnorm2): BatchNorm1d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)

GCN(
  (layers): ModuleList(
    (0): GraphConv(in=1024, out=256, normalization=both, activation=None)
    (1): GraphConv(in=256, out=19, normalization=both, activation=None)
  )
  (dropout): Dropout(p=0.2, inplace=False)
)

SAGE(
  (layers): ModuleList(
    (0): SAGEConv(
      (feat_drop): Dropout(p=0.0, inplace=False)
      (fc_self): Linear(in_features=1024, out_features=256, bias=False)
      (fc_neigh): Linear(in_features=1024, out_features=256, bias=False)
    )
    (1): SAGEConv(
      (feat_drop): Dropout(p=0.0, inplace=False)
      (fc_self): Linear(in_features=256, out_features=19, bias=False)
      (fc_neigh): Linear(in_features=256, out_features=19, bias=False)
    )
  )
  (dropout): Dropout(p=0.2, inplace=False)
)

As you mentioned, I do see that the MLP performs quite well compared to the GNNs but for similar runs and model sizes GNNs do have an edge but this is an interesting observation that should be studied more closely.

Regarding your comment about the 45% drop in performance, I tested that right now as well and saw a similar drop. We ran this experiment as discussed in our paper and noticed that GNN performance drops significantly when we use synthetic node embeddings as opposed to node embeddings generated using NLP methods.

0 replies

msharmavikram · 2023-03-05T18:49:34Z

msharmavikram
Mar 5, 2023
Maintainer

@akhatua2 can you please check the impact of labeled nodes on MLP vs GNN performance? Perhaps having more labeled nodes helps MLP. I don't recall testing this. The other experiment worth studying is reducing embedding dim and seeing the impact.

0 replies

akhatua2 · 2023-03-05T19:19:06Z

akhatua2
Mar 5, 2023
Maintainer

I ran the experiment with GraphSAGE and MLP models using variable embedding shape until training convergence.

Model	Embedding shape	Test Acc
MLP	1024	66.0%
	768	66.0%
	512	60.0%
	384	64.0%
GraphSAGE	1024	69.38%
	768	68.97%
	512	66.40%
	384	67.23%

Note the 512-dim embedding model is a multilingual model (which causes the slight drop there overall).

Essentially it seems that the shape of the embedding has a major impact on the performance of the MLP model but we need to study this further.

0 replies

msharmavikram · 2023-03-05T19:25:56Z

msharmavikram
Mar 5, 2023
Maintainer

This is precisely why the IGB dataset is created - to study the impact of variable-size embeddings. If the community is interested in doing this sort of study, we should consider figuring out the right APIs to help them. Our current APIs do not provide this flexibility at the fingertips.

0 replies

jumxglhf · 2023-03-05T20:18:35Z

jumxglhf
Mar 5, 2023
Author

Dear authors,

Thanks for the timely and detailed answers!

I trained MLPs and GNNs through an early-stopping strategy according to the validation loss, giving me a 71 ACC for GCN and 72 ACC for MLPs. MLPs and GNNs I used are all two-layer with 64 hidden dimensions.

I feel that training only 10 epochs is probably not fully utilizing the capability of either MLPs or GNNs. Can you try to increase the training steps or explore the early-stopping strategies to push the model a little further?

Best,
Mingxuan

0 replies

akhatua2 · 2023-03-05T21:08:06Z

akhatua2
Mar 5, 2023
Maintainer

I updated the results in my last comment. I ran the models until the training accuracy converged. The MLP does seem to perform very well however the node embedding dimension has a greater impact on its performance. We hypothesize that the node embeddings generated by the NLP models are providing a lot of information so that affects the MLP performance which only relies on the embeddings, whereas the GNNs which also learn from the graph structure are impacted less.

The performance of MLP wrt to node embedding dimensions, node embedding model types and fraction of labelled data is a very interesting experiment and requires more understanding.

Arpan

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Several questions after trying out these datasets #14

Uh oh!

{{title}}

Uh oh!

Replies: 7 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Several questions after trying out these datasets #14

Uh oh!

jumxglhf Mar 5, 2023

Replies: 7 comments

Uh oh!

jumxglhf Mar 5, 2023 Author

Uh oh!

akhatua2 Mar 5, 2023 Maintainer

Uh oh!

Uh oh!

msharmavikram Mar 5, 2023 Maintainer

Uh oh!

Uh oh!

akhatua2 Mar 5, 2023 Maintainer

Uh oh!

msharmavikram Mar 5, 2023 Maintainer

Uh oh!

jumxglhf Mar 5, 2023 Author

Uh oh!

akhatua2 Mar 5, 2023 Maintainer

jumxglhf
Mar 5, 2023

jumxglhf
Mar 5, 2023
Author

akhatua2
Mar 5, 2023
Maintainer

msharmavikram
Mar 5, 2023
Maintainer

akhatua2
Mar 5, 2023
Maintainer

msharmavikram
Mar 5, 2023
Maintainer

jumxglhf
Mar 5, 2023
Author

akhatua2
Mar 5, 2023
Maintainer