Datasets and databases from News
News articles extracted from different sources and structured in CSV and JSON files. Large news datasets ideal for the NLP and machine learning.
Unleash the power of news with comprehensive datasets and databases! Dive into current events, track trends, and analyze public sentiment. Download free news archives or explore premium collections. Refine your research and gain valuable insights from a world of news information.
Use cases:
Machine translation
NPL
Fake news detection
Sentiment analysis
Al Jazeera latest news dataset till Jun 2021
Data extracted from the site aljazeera.com in
json
format and having more then 222 Thousand records
Euronews dataset in csv format
Data extracted from the site euronews.com in
CSV
format and having more then 27 Thousand records
Cointelegraph news dataset
Data extracted from the site cointelegraph.com in
csv
format and having more then 40 Thousand records
CNBC Economy Articles Dataset
Data extracted from the site cnbc.com in
CSV
format and having more then 17 Thousand records
The Japan Times news dataset May 2022
Data extracted from the site japantimes.co.jp in
CSV
format and having more then 76 Thousand records
Aljazeera news dataset 2022 feb
Data extracted from the site aljazeera.com in
csv
format and having more then 233 Thousand records
Time Magazine Latest News dataset
Data extracted from the site time.com in
JSON
format and having more then 160 Thousand records
Largest news articles dataset from CNBC
Data extracted from the site cnbc.com in
csv
format and having more then 480 Thousand records
Euro news datasets
Data extracted from the site euronews.com in
csv
format and having more then 15 Thousand records
Techcrunch news dataset
Data extracted from the site techcrunch.com in
CSV
format and having more then 150 Thousand records
CNN news dataset
Data extracted from the site edition.cnn.com in
json
format and having more then 27 Thousand records
Al Jazeera latest news dataset
Data extracted from the site aljazeera.com in
JSON
format and having more then 200 Thousand records
BBC Latest News Dataset 2021
Data extracted from the site bbc.co.uk in
json
format and having more then 1.17 Million records
BBC news dataset feb 2023
Data extracted from the site bbc.com in
CSV
format and having more then 1 Million records
Bloomberg Quint news dataset
Data extracted from the site bloombergquint.com in
json
format and having more then 400 Thousand records
News category dataset from huffpost
Data extracted from the site huffpost.com in
CSV
format and having more then 500 Thousand records
Fox News dataset is for analyzing media trends and narratives
Data extracted from the site foxnews.com in
CSV
format and having more then 1.4 Million records
The Japan times news dataset
Data extracted from the site japantimes.co.jp in
csv
format and having more then 75 Thousand records
Complete News Data Extracted from CNBC in JSON Format: Covering Business, Finance, Technology, and Global Trends for Europe, US, and UK Audiences
Data extracted from the site cnbc.com in
JSON
format and having more then 500 Thousand records
Aljazeera news datasets
Domain: aljazeera.com
There 3 datasets extracted from the aljazeera.com and data available in both JSON and CSV formats.
Euronews news datasets
Domain: euronews.com
There 2 datasets extracted from the euronews.com and data available in both JSON and CSV formats.
Cointelegraph news datasets
Domain: cointelegraph.com
There 1 datasets extracted from the cointelegraph.com and data available in both JSON and CSV formats.
Cnbc news datasets
Domain: cnbc.com
There 3 datasets extracted from the cnbc.com and data available in both JSON and CSV formats.
Japantimes news datasets
Domain: japantimes.co.jp
There 2 datasets extracted from the japantimes.co.jp and data available in both JSON and CSV formats.
Time news datasets
Domain: time.com
There 1 datasets extracted from the time.com and data available in both JSON and CSV formats.
Techcrunch news datasets
Domain: techcrunch.com
There 1 datasets extracted from the techcrunch.com and data available in both JSON and CSV formats.
Edition news datasets
Domain: edition.cnn.com
There 1 datasets extracted from the edition.cnn.com and data available in both JSON and CSV formats.
Bbc news datasets
Domain: bbc.co.uk
There 1 datasets extracted from the bbc.co.uk and data available in both JSON and CSV formats.
Bbc news datasets
Domain: bbc.com
There 1 datasets extracted from the bbc.com and data available in both JSON and CSV formats.
Bloombergquint news datasets
Domain: bloombergquint.com
There 1 datasets extracted from the bloombergquint.com and data available in both JSON and CSV formats.
Huffpost news datasets
Domain: huffpost.com
There 1 datasets extracted from the huffpost.com and data available in both JSON and CSV formats.
Foxnews news datasets
Domain: foxnews.com
There 1 datasets extracted from the foxnews.com and data available in both JSON and CSV formats.