Computing shared edges in two graphs

This project is for use in the course DA3018.

This project contains a program called snitt.py that takes two files as input. The files are supposed to contain one graph each, stored as lists of edges: each line is a pair of vertex names. I have two very large graphs I want to compare and the program is therefore designed to be sparing with memory, although that is not something I have worked hard to optimize. The idea is that I store only one of the graphs in memory and then just read the edges from the other file to check whether they are present in graph 1 or not. The output is a count of shared and unique edges.

Currently, there is something wrong with the program. I have two small test files, t1.txt and t2.txt, that share two edges and have one unique edge each.

There are also two larger test files, pairs1.txt and pairs2.txt if you want to work with that. They are taken from the original graph data, which contained roughly 80 million and 50 million edges.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computing shared edges in two graphs

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
pairs1.txt		pairs1.txt
pairs2.txt		pairs2.txt
snitt.py		snitt.py
t1.txt		t1.txt
t2.txt		t2.txt

arvestad/snitt

Folders and files

Latest commit

History

Repository files navigation

Computing shared edges in two graphs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages