Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn how to process unstructured text data from Reddit and extract organization names using SpaCy's Named Entity Recognition (NER) in Python. Explore techniques for identifying stocks mentioned in various formats, including organization names and ticker symbols. Discover how to differentiate between tickers and other abbreviations or slang. Follow along as the video guides you through importing SpaCy, loading models, working with document objects, and creating organization lists. Gain hands-on experience in importing data, extracting entities, filtering results, and using Counter objects for analysis. By the end of this tutorial, you'll be equipped to automatically classify and assign results to the correct stocks for further analysis of Reddit data.
Syllabus
Introduction
Importing SpaCy
SpaCy Models
Load Model
Document Object
Organization List
Importing Data
Getting Entities
Filtering Entities
Counter Object
Taught by
James Briggs