Add TokenGT model #9834

michailmelonas · 2024-12-09T18:59:48Z

PyG implementation of the Tokenized Graph Transformer following "Pure Transformers are Powerful Graph Learners" (https://arxiv.org/pdf/2207.02505). Includes support for both Laplacian eigenvectors and ORF node identifiers (implemented via a simple data Transform object). A graph regression example is also included.

For a detailed blog post about the implementation, see https://medium.com/stanford-cs224w/pyg-implementation-tokengt-e4aa74dc867b.

…to add-token-gt

michailmelonas · 2025-01-14T08:01:03Z

@wsad1 @EdisonLeeeee @akihironitta any thoughts on when this contribution will get reviewed? :)

puririshi98 · 2025-01-14T20:56:37Z

@michailmelonas this is cool, ill review and help merge soon as my time allows,

…ric into add-token-gt

puririshi98 · 2025-01-15T17:30:12Z

this looks good, will do a deep review soon

puririshi98 · 2025-01-15T17:55:04Z

this is good at a high level. however i want to see how it compares to existing work. Can you please update this example:
https://github.com/pyg-team/pytorch_geometric/blob/master/examples/ogbn_train.py#L31
to have a "--gnn-choice" arg parse option, with choices ["sage, gat, tokengt_graph_transformer"]. and run all 3 in your environment to see how they compare. Please make the highest test acc the default. I can review a little closer once that initial test is done

michailmelonas · 2025-01-15T19:02:02Z

@puririshi98 sure thing, will do asap.

…to add-token-gt

puririshi98 · 2025-01-23T04:18:46Z

@michailmelonas lmk when ready for further review

…ric into add-token-gt

michailmelonas · 2025-01-25T12:49:22Z

@puririshi98 apologies for only getting back to you now - have been swamped at work.

TokenGT requires specifying n_nodes orthogonal vectors ("node identifiers"). This is infeasible for the ogbn-papers100M dataset which has over 100M nodes. Therefore, rather than amending ogbn_train.py, I instead added token_gt_ogbn.py: a script that makes it easy to benchmark TokenGT against GCN on the ogbg-molhiv dataset (ideally, I'd like to run the model on PCQM4Mv2 as in the paper, but given my computational resources this was the best I could do). Running said script, I get slightly worse (but comparable) results for TokenGT vs GCN: the former has a validation ROC-AUC of 0.774 and the latter has 0.819.

…to add-token-gt

puririshi98 · 2025-01-28T01:33:52Z

i think as a sanity check to get this merged, you should make an example which uses some opensource dataset(check relbench or ogb) to show higher accuracy than gcn and sage (with an argparser to choose between the three, defaulting to your graphtransformer). it will be a good research experience for you

michailmelonas · 2025-01-28T19:25:47Z

Okay, will do. Will most likely only get to this next week. Apologies that this is dragging.

codecov · 2025-01-31T18:42:03Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 86.39%. Comparing base (aa6cf80) to head (e5db369).

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #9834      +/-   ##
==========================================
- Coverage   86.79%   86.39%   -0.41%     
==========================================
  Files         490      492       +2     
  Lines       32436    32594     +158     
==========================================
+ Hits        28154    28159       +5     
- Misses       4282     4435     +153

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

puririshi98 · 2025-02-03T23:35:43Z

Okay, will do. Will most likely only get to this next week. Apologies that this is dragging.

no problem, looking forward to seeing what you can do :)

puririshi98 · 2025-02-20T01:28:32Z

checking in @michailmelonas hows it going?

puririshi98 · 2025-03-01T00:57:57Z

c2bbb41

please ensure you follow this

michailmelonas · 2025-03-02T17:39:40Z

@puririshi98 really sorry for the late response - between coursework (https://web.stanford.edu/class/cs234/) and working full time I've not had a chance to get to this. I think the PCQM4Mv2 dataset (https://ogb.stanford.edu/docs/lsc/pcqm4mv2/) would be best to benchmark TokenGT against GCN/GraphSAGE (this is what was used in the original paper). I've obtained some GPU credits to run this experiment. Realistically, I won't be able to start with this in the next two weeks, but will get to it asap post 18/03.

michailramp added 9 commits December 9, 2024 09:01

Add TokenGT implementation

a3e5310

Extend example to case of single graph

748b329

Update docstring

d98599a

Expand docstrings

4943a9d

Fix typo in docstring

acc9dfa

Move period

3911c00

Add docstrings and comments

930938b

Add example of graph regression using TokenGT

6fb3e58

Update readme and changelog docs

3fbb465

michailmelonas requested review from wsad1 and EdisonLeeeee as code owners December 9, 2024 18:59

michailramp added 5 commits December 9, 2024 21:07

Add PR details to changelog entry

2f2bc78

Add asserts to avoid linting failure

5f6c41d

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

58fa8ed

…to add-token-gt

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

c1f55a0

…to add-token-gt

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

2f388b4

…to add-token-gt

akihironitta added feature nn transform labels Dec 30, 2024

michailramp added 4 commits January 6, 2025 08:32

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

57ae5dc

…to add-token-gt

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

638851d

…to add-token-gt

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

f366c9e

…to add-token-gt

Resolve conflicts

c9b4ff4

Merge branch 'master' into add-token-gt

95ba4d5

puririshi98 self-requested a review January 14, 2025 20:56

michailramp added 2 commits January 15, 2025 09:34

Rebase and fix conflict

1465d91

Merge branch 'add-token-gt' of personal:michailmelonas/pytorch_geomet…

2f7affc

…ric into add-token-gt

michailramp and others added 2 commits January 18, 2025 14:45

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

0b776cf

…to add-token-gt

Merge branch 'master' into add-token-gt

27afafd

puririshi98 mentioned this pull request Jan 24, 2025

Graph Transformer Enhancement #9751

Open

michailramp added 3 commits January 25, 2025 10:27

Rebase and resolve conflict

c169cd0

Merge branch 'add-token-gt' of personal:michailmelonas/pytorch_geomet…

d99206b

…ric into add-token-gt

Reformat

e1e0d13

michailramp and others added 2 commits January 26, 2025 14:06

Merge branch 'master' of personal:michailmelonas/pytorch_geometric in…

1d664ca

…to add-token-gt

Merge branch 'master' into add-token-gt

4cb7811

Merge branch 'master' into add-token-gt

e5db369

Merge branch 'master' into add-token-gt

ee57934

Merge branch 'master' into add-token-gt

cf9bb9c

michailramp added 2 commits March 2, 2025 18:59

Rebase with remote

5b89c55

Rebase with remote

28f83f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TokenGT model #9834

Add TokenGT model #9834

michailmelonas commented Dec 9, 2024 •

edited

Loading

michailmelonas commented Jan 14, 2025

puririshi98 commented Jan 14, 2025

puririshi98 commented Jan 15, 2025

puririshi98 commented Jan 15, 2025 •

edited

Loading

michailmelonas commented Jan 15, 2025

puririshi98 commented Jan 23, 2025

michailmelonas commented Jan 25, 2025

puririshi98 commented Jan 28, 2025 •

edited

Loading

michailmelonas commented Jan 28, 2025

codecov bot commented Jan 31, 2025

puririshi98 commented Feb 3, 2025

puririshi98 commented Feb 20, 2025

puririshi98 commented Mar 1, 2025 •

edited

Loading

michailmelonas commented Mar 2, 2025

Add TokenGT model #9834

Are you sure you want to change the base?

Add TokenGT model #9834

Conversation

michailmelonas commented Dec 9, 2024 • edited Loading

michailmelonas commented Jan 14, 2025

puririshi98 commented Jan 14, 2025

puririshi98 commented Jan 15, 2025

puririshi98 commented Jan 15, 2025 • edited Loading

michailmelonas commented Jan 15, 2025

puririshi98 commented Jan 23, 2025

michailmelonas commented Jan 25, 2025

puririshi98 commented Jan 28, 2025 • edited Loading

michailmelonas commented Jan 28, 2025

codecov bot commented Jan 31, 2025

Codecov Report

puririshi98 commented Feb 3, 2025

puririshi98 commented Feb 20, 2025

puririshi98 commented Mar 1, 2025 • edited Loading

michailmelonas commented Mar 2, 2025

michailmelonas commented Dec 9, 2024 •

edited

Loading

puririshi98 commented Jan 15, 2025 •

edited

Loading

puririshi98 commented Jan 28, 2025 •

edited

Loading

puririshi98 commented Mar 1, 2025 •

edited

Loading