DataMining

HW1: Finding Similar Items: You are to implement the stages of finding textually similar documents based on Jaccard similarity using the shingling, minhashing, and locality-sensitive hashing (LSH) techniques and corresponding algorithms. PySpark code

HW2: Discovery of Frequent Itemsets and Association Rules Implement the Apriori algorithm for finding frequent itemsets with support at least s in a dataset of transactions. Python and PySpark code

HW3: Implementation of "L. De Stefani, A. Epasto, M. Riondato, and E. Upfal: TRIÈST: Counting Local and Global Triangles in Fully-Dynamic Streams with Fixed Memory Size (KDD'16) with Python.

HW4: Implementation of "On Spectral Clustering: Analysis and an algorithm” by Andrew Y. Ng, Michael I. Jordan, Yair Weiss. Matlab and Python code

HW5: Implementation of "F. Rahimian, et al., JA-BE-JA: A Distributed Algorithm for Balanced Graph Partitioning, SASO2013"

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
hw1		hw1
hw3		hw3
hw4		hw4
hw5		hw5
hw_2		hw_2
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
hw-2.ipynb		hw-2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DataMining

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DataMining

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages