Skip to content
#

article-extraction

Here are 2 public repositories matching this topic...

Language: All
Filter by language

GNewsScraper is a TypeScript package that scrapes article data from Google News based on a keyword or phrase. It returns the results as an array of JSON objects, making it convenient to access and use the scraped information

  • Updated Aug 19, 2023
  • TypeScript

This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and applies NLP methods to them. NLP tasks: Tokenization, Lemmatization, TF-IDF, Part-of-speech tagging, semantic search with transformers, article extraction and OCR post-correction with LLMs, NER and text classification

  • Updated Dec 24, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the article-extraction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the article-extraction topic, visit your repo's landing page and select "manage topics."

Learn more