## Data Sketch Library
This is an open-source python library that provides Python implementation of sketching algorithms and other algorithms for large data processing.
Currently, we provide some sketching and filtering algorithms such as
- Heavy Hitter Finder
- Distinct Element Count
- Second Frequency Moment Estimation
- Reservoir Sampling
- Bloom Filter
[Source Code] (https://github.com/db-gb/DataStreamSketches)
[PyPi] (https://pypi.org/project/sketchlib/)
If you are interested in contributing, please send an email to hvu2 at sdsu dot edu.
## Contributors:
- Hoa T. Vu
- Daniel Barnas