Skip to content

balajisriraj/News-Media-Publisher-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

News Media Publisher Analysis

This git repository contains code py files, notebook files, to analyze the Media Publishers in terms of their published articles.

** This is only a draft copy

Project Flow

  • Extract Google Search URLs for the User's Search Term
  • Extract News Articles data using the URL's from Meta data
  • Process & Clean the text data
  • Analyze the data using typical NLP metrics
  • Using Pre-trained models predict the Over all Sentiment & Sentiment Flow
  • Using Pre-trained models predict the Over all Emotion & Emotion Flow
  • Apply Clustering Models and identify different Clusters
  • Using the Clusters as the Lables create a Supervised Classification Model
  • Using the Model identify the Feature Importance
  • Using those Features differentiate the clusters and the Media houses

Authors

Installation

Install the required packages to run the Notebook

  pip install requirments.txt

🛠 Skills

Python, NLP, Text Processing,Transformers, Huggingface, Web Scraping

Screenshots

Feature Importance Feature Effects W0rd Cloud

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published