News Edition json

CNN news dataset

Source: edition.cnn.com · Collected: July 2021 · Format: json

CrawlFeeds is not affiliated with, endorsed by, or sponsored by Edition. This dataset is independently collected from publicly available pages on edition.cnn.com. "Edition" is a registered trademark used here solely to describe the source of the data.

Records

27 Thousand

Fields

Format

json

Last collected

July 2021

Description

This dataset contains over 27,000 news articles sourced from CNN.com, including full content, metadata, and media fields. Each article is enriched with publish dates, author information, descriptions, and full raw + cleaned content—perfect for media research, sentiment analysis, topic modeling, and natural language processing (NLP) projects.

Last crawled in July 2021, this collection offers a historical snapshot of CNN’s reporting and editorial content.

Use Cases:

News content analysis
Fake news detection & bias tracking
Topic classification and clustering
Training AI/NLP models
Historical news trend research
Media monitoring tools

Update Frequency:

Archived — no current updates, great for snapshot-based analysis

Data fields

title

url

published_at. last_modified_at

author

short_description

header_image

raw_content

content

crawled_at

_id

source

CNN news dataset

Use Cases:

Update Frequency:

Data at any scale, any source

Everything you need, none of the overhead

Trusted by data teams worldwide

Ready to get your data?