Datasets and databases from News
News articles extracted from different sources and structured in CSV and JSON files. Large news datasets ideal for the NLP and machine learning.
Unleash the power of news with comprehensive datasets and databases! Dive into current events, track trends, and analyze public sentiment. Download free news archives or explore premium collections. Refine your research and gain valuable insights from a world of news information.
Use cases:
Machine translation
NPL
Fake news detection
Sentiment analysis
Cointelegraph news dataset
                                                        Data extracted from the site cointelegraph.com in 
csv
                                                         format and having more then 40 Thousand records 
                                                    
CNN news dataset
                                                        Data extracted from the site edition.cnn.com in 
json
                                                         format and having more then 27 Thousand records 
                                                    
The Japan Times news dataset May 2022
                                                        Data extracted from the site japantimes.co.jp in 
CSV
                                                         format and having more then 76 Thousand records 
                                                    
Fox News dataset is for analyzing media trends and narratives
                                                        Data extracted from the site foxnews.com in 
CSV
                                                         format and having more then 1.4 Million records 
                                                    
BBC News Dataset – February 2023 Edition
                                                        Data extracted from the site bbc.com in 
CSV
                                                         format and having more then 1 Million records 
                                                    
Largest news articles dataset from CNBC
                                                        Data extracted from the site cnbc.com in 
csv
                                                         format and having more then 480 Thousand records 
                                                    
The Japan times news dataset
                                                        Data extracted from the site japantimes.co.jp in 
csv
                                                         format and having more then 75 Thousand records 
                                                    
News category dataset from huffpost
                                                        Data extracted from the site huffpost.com in 
CSV
                                                         format and having more then 500 Thousand records 
                                                    
CNBC Economy Articles Dataset
                                                        Data extracted from the site cnbc.com in 
CSV
                                                         format and having more then 17 Thousand records 
                                                    
BBC Latest News Dataset 2021
                                                        Data extracted from the site bbc.co.uk in 
json
                                                         format and having more then 1.17 Million records 
                                                    
Euro news datasets
                                                        Data extracted from the site euronews.com in 
csv
                                                         format and having more then 15 Thousand records 
                                                    
Techcrunch news dataset
                                                        Data extracted from the site techcrunch.com in 
CSV
                                                         format and having more then 150 Thousand records 
                                                    
Al Jazeera latest news dataset
                                                        Data extracted from the site aljazeera.com in 
JSON
                                                         format and having more then 200 Thousand records 
                                                    
Euronews dataset in csv format
                                                        Data extracted from the site euronews.com in 
CSV
                                                         format and having more then 27 Thousand records 
                                                    
Aljazeera news dataset 2022 feb
                                                        Data extracted from the site aljazeera.com in 
csv
                                                         format and having more then 233 Thousand records 
                                                    
Bloomberg Quint news dataset
                                                        Data extracted from the site bloombergquint.com in 
json
                                                         format and having more then 400 Thousand records 
                                                    
Time Magazine Latest News dataset
                                                        Data extracted from the site time.com in 
JSON
                                                         format and having more then 160 Thousand records 
                                                    
Al Jazeera latest news dataset till Jun 2021
                                                        Data extracted from the site aljazeera.com in 
json
                                                         format and having more then 222 Thousand records 
                                                    
Complete News Data Extracted from CNBC in JSON Format: Covering Business, Finance, Technology, and Global Trends for Europe, US, and UK Audiences
                                                        Data extracted from the site cnbc.com in 
JSON
                                                         format and having more then 500 Thousand records 
                                                    
Cointelegraph news datasets
Domain: cointelegraph.com
There 1 datasets extracted from the cointelegraph.com and data available in both JSON and CSV formats.
Edition news datasets
Domain: edition.cnn.com
There 1 datasets extracted from the edition.cnn.com and data available in both JSON and CSV formats.
Japantimes news datasets
Domain: japantimes.co.jp
There 2 datasets extracted from the japantimes.co.jp and data available in both JSON and CSV formats.
Foxnews news datasets
Domain: foxnews.com
There 1 datasets extracted from the foxnews.com and data available in both JSON and CSV formats.
Bbc news datasets
Domain: bbc.com
There 1 datasets extracted from the bbc.com and data available in both JSON and CSV formats.
Cnbc news datasets
Domain: cnbc.com
There 3 datasets extracted from the cnbc.com and data available in both JSON and CSV formats.
Huffpost news datasets
Domain: huffpost.com
There 1 datasets extracted from the huffpost.com and data available in both JSON and CSV formats.
Bbc news datasets
Domain: bbc.co.uk
There 1 datasets extracted from the bbc.co.uk and data available in both JSON and CSV formats.
Euronews news datasets
Domain: euronews.com
There 2 datasets extracted from the euronews.com and data available in both JSON and CSV formats.
Techcrunch news datasets
Domain: techcrunch.com
There 1 datasets extracted from the techcrunch.com and data available in both JSON and CSV formats.
Aljazeera news datasets
Domain: aljazeera.com
There 3 datasets extracted from the aljazeera.com and data available in both JSON and CSV formats.
Bloombergquint news datasets
Domain: bloombergquint.com
There 1 datasets extracted from the bloombergquint.com and data available in both JSON and CSV formats.
Time news datasets
Domain: time.com
There 1 datasets extracted from the time.com and data available in both JSON and CSV formats.