u/laaweel I have another quick question. Cora dataset splits the labels into 140 trains, 500 for valid and 1000 for test (according to DGL website). I found that these numbers correspond to the number of nodes (node classification problem). But any thought why the sum (140+500+1000) does not match the total node number in Cora dataset (2708 nodes)? Is it because the rest of the nodes are unlabeled? Thank you!
ramya_1995 OP t1_j56ovcx wrote
Reply to comment by laaweel in [D] GCN datasets by ramya_1995
u/laaweel I have another quick question. Cora dataset splits the labels into 140 trains, 500 for valid and 1000 for test (according to DGL website). I found that these numbers correspond to the number of nodes (node classification problem). But any thought why the sum (140+500+1000) does not match the total node number in Cora dataset (2708 nodes)? Is it because the rest of the nodes are unlabeled? Thank you!