Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Min Hashing in Data Mining - Spring 2023

UofU Data Science via YouTube

Overview

Learn about Min Hashing techniques in data mining through this comprehensive lecture recording that covers essential concepts and practical implementations. Begin with foundational discussions on plagiarism policies and project requirements before diving into the core material. Explore the motivation behind Min Hashing, understand characteristic matrices, and master Min Hash value calculations through guided exercises. Examine crucial properties and improved estimation methods before transitioning to practical fast Min Hash implementations with detailed examples. Conclude by learning how to estimate Jaccard Similarity from signatures and make informed decisions about choosing appropriate k values. Practice with hands-on exercises throughout the lecture to reinforce understanding of both theoretical concepts and real-world applications.

Syllabus

Recording starts
Lecture starts
Announcement Piazza
Plagiarism/cheating policy
Projects
Motivation
Characteristic matrix
Minhash value
Minhash exercise
Crucial property
Better estimate
Fast minhash actual implementation
Fast minhash example
Fast minhash exercise
Estimating JS from signatures
Choosing k
Lecture ends

Taught by

UofU Data Science

Reviews

Start your review of Min Hashing in Data Mining - Spring 2023

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.