Medical Search Engine with SPLADE + Sentence Transformers in Python

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Grab it

Learn how to build a medical search engine using hybrid search with NLP information retrieval models in Python. Explore the implementation of hybrid search combining sentence transformers and SPLADE for medical question-answering. Discover how to leverage both dense and sparse vectors to cover semantics and enable exact matching and keyword search. Dive into SPLADE, a powerful sparse embedding method outperforming BM25, and learn how it minimizes vocabulary mismatch problems. Follow along with a practical demo using SPLADE and a sentence transformer model trained on MS-MARCO, implemented via Hugging Face transformers. Gain hands-on experience with the Pinecone vector database for the search component, supporting SPLADE vectors natively. Cover topics including data preprocessing, creating dense and sparse vector embeddings, preparing data for Pinecone, creating a sparse-dense index, and making hybrid search queries.

Syllabus

Hybrid search for medical field
Hybrid search process
Prerequisites and Installs
Pubmed QA data preprocessing step
Creating dense vectors with sentence-transformers
Creating sparse vector embeddings with SPLADE
Preparing sparse-dense format for Pinecone
Creating the Pinecone sparse-dense index
Making hybrid search queries
Final thoughts on sparse-dense with SPLADE

Taught by

James Briggs

Reviews

Start your review of Medical Search Engine with SPLADE + Sentence Transformers in Python

Taught by

Supercharge eCommerce Search - OpenAI's CLIP, BM25, and Python

SPLADE - The First Search Model to Beat BM25

NER Powered Semantic Search in Python

How to Use OpenAI Whisper to Fix YouTube Search

Advanced Sentiment Analysis with NLP Transformers and Vector Search

Better Semantic Search with Hybrid Sparse-Dense Search - Haystack US 2023

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

10 Best Python Courses for 2024: Charming the Snake

Never Stop Learning.