Features • Installation • Examples
MGTCOM Framework is meant to be used as a community detection algorithm in temporal multimodal graphs. The framework learns temporally and topologically aware embeddings and detects communities in tandem.
- MGTCOM: Learns temporal and topological embeddings
- MGTCOM Topo: Learns topological embeddings
- MGTCOM Tempo: Learns temporal embeddings
You can use MGTCOM models by either setting it up using Docker or by installing it manually.
- Install Docker and Docker Compose
- Build the Docker image:
docker-compose build
- Run scripts as
docker-compose run mgtcom <script name>
System Requirements:
Note that while conda installs most of the compiled dependencies, the system still needs to have an up-to-date
glibc
version (2.27 and up).Check your version using following command
ldd --version
- Install either Anaconda, Miniconda, or Mamba (recommended).
- Set up the environment:
- mamba env update --prefix=./env --f environment.yml --prune
- For cpu only usage replace
pytorch::cudatoolkit=11.3
withpytorch::cpuonly
in the environment.yml file.
- Activate the environment:
source activate.sh
- Extract preprocessed datasets:
- Download the preprocessed datasets from Drive
mkdir -p ./storage/cache/dataset
cp ./datasets.zip ./storage/cache/dataset/datasets.zip
cd ./storage/cache/dataset && unzip datasets.zip
The executors are found in ml/ml/executors
directory.
python ml/ml/executors/mgcom_combi_executor.py -h
# For docker users
docker-compose run mgtcom ml/ml/executors/mgcom_combi_executor.py -h
python ml/ml/executors/mgcom_combi_executor.py --repr_dim=64
# For docker users
docker-compose run mgtcom ml/ml/executors/mgcom_combi_executor.py --repr_dim=64
Footnotes
-
W. L. Hamilton, R. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” in Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, NY, USA, Dec. 2017, pp. 1025–1035. ↩
-
A. Grover and J. Leskovec, “node2vec: Scalable Feature Learning for Networks,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, Aug. 2016, pp. 855–864. doi: 10.1145/2939672.2939754. ↩
-
S. Cavallari, V. W. Zheng, H. Cai, K. C.-C. Chang, and E. Cambria, “Learning Community Embedding with Community Detection and Node Embedding on Graphs,” in Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, Singapore Singapore, Nov. 2017, pp. 377–386. doi: 10.1145/3132847.3132925. ↩
-
B. Rozemberczki, R. Davies, R. Sarkar, and C. Sutton, “GEMSEC: graph embedding with self clustering,” in Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, New York, NY, USA, Aug. 2019, pp. 65–72. doi: 10.1145/3341161.3342890. ↩
-
L. Luo, Y. Fang, X. Cao, X. Zhang, and W. Zhang, “Detecting Communities from Heterogeneous Graphs: A Context Path-based Graph Neural Network Model,” in Proceedings of the 30th ACM International Conference on Information & Knowledge Management, New York, NY, USA: Association for Computing Machinery, 2021, pp. 1170–1180. Accessed: Jan. 08, 2022. [Online]. Available: https://doi.org/10.1145/3459637.3482250 ↩
-
G. H. Nguyen, J. B. Lee, R. A. Rossi, N. K. Ahmed, E. Koh, and S. Kim, “Continuous-Time Dynamic Network Embeddings,” in Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW ’18, Lyon, France, 2018, pp. 969–976. doi: 10.1145/3184558.3191526. ↩
-
Y. Dong, N. V. Chawla, and A. Swami, “metapath2vec: Scalable Representation Learning for Heterogeneous Networks,” in Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, Aug. 2017, pp. 135–144. doi: 10.1145/3097983.3098036. ↩
-
J. Yang and J. Leskovec, “Defining and evaluating network communities based on ground-truth,” in Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics, New York, NY, USA, Aug. 2012, pp. 1–8. doi: 10.1145/2350190.2350193. ↩
-
A. García-Durán, S. Dumancic, and M. Niepert, “Learning Sequence Encoders for Temporal Knowledge Graph Completion,” Jan. 2018, pp. 4816–4821. doi: 10.18653/v1/D18-1516. ↩
-
“IMDB 5000 Movie Dataset.” https://kaggle.com/carolzhangdc/imdb-5000-movie-dataset (accessed Jan. 14, 2022). ↩
-
S. Wang, M. Schraagen, E. Tjong Kim Sang, and M. Dastani, “Public Sentiment on Governmental COVID-19 Measures in Dutch Social Media,” presented at the EMNLP-NLP-COVID19 2020, Online, Dec. 2020. doi: 10.18653/v1/2020.nlpcovid19-2.17. ↩
-
Z. Yang, W. W. Cohen, and R. Salakhutdinov, “Revisiting semi-supervised learning with graph embeddings,” in Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48, New York, NY, USA, Jun. 2016, pp. 40–48. ↩ ↩2