How to scrape S
outh China Morning Post
with AgentQL

Looking for a better way to scrape South China Morning Post? Say goodbye to fragile XPath or DOM selectors that easily break with website updates. AI-powered AgentQL ensures consistent web scraping across various platforms, from South China Morning Post to any other website, regardless of UI changes.

Learn moreTry the playground, free! ->

Not just for scraping South China Morning Post

Smart selectors work anywhere

https://scmp.com

URL

Input any webpage.

{
  headlines[]
  datePublished
  author
  articleBody
}

Query

Describe data in natural language.

{
  "headlines": [
    "Hong Kong leader John Lee defends national security law",
    "China\u2019s factory activity shrinks for third month"
  ],
  "datePublished": "2024-04-26",
  "author": "South China Morning Post",
  "articleBody": "Article content here."
}

Returns

Receive accurate output in seconds.

How to use AgentQL on South China Morning Post

A dotted lineA blue lineA blue line
1

Install the SDK

Install code for JS and Python

npm install agentql

pip3 install agentql

2

Test and refine

Use the query debugger

3

Run your script

Install code for both JS and Python

agentql init

python example.py

More Websites to Scrape

Get started

Holds no opinions on what’s and how’s. Build whatever makes sense to you.