In this live coding session, we leverage the Python Reddit API Wrapper (PRAW) to retrieve data from subreddits on Reddit, and perform sentiment analysis using pipelines from HuggingFace ( 🤗 the GitHub of Machine Learning ), powered by transformer.
At the end of this session, you will be able to:
- Know how to work with APIs
- Feel more comfortable navigating thru documentation, even inspecting the source code
- Understand what a
pipelineobject is in HuggingFace - perform sentiment analysis using
pipeline - Run a python script in command line and get the results
- Examine the quality of data
- Understand data lineage
Create a new Conda environment for sentiment anaylsis (sa). If you already have this environment, continue to the next step.
conda create -n sa python=3.8 jupyter -yActivate your new environment
conda activate saOpen the jupyter-notebook
jupyter-notebookNavigate through the repo in the notebook to find imports.ipynb for this week and open it.
Run all of the cells in the notebook.
Please review the weekly narrative here
