Pular para o conteúdo principal

SERPINFO

SERPINFO is the mechanism that allows uBlacklist to work on search engines such as Google, Bing, and DuckDuckGo. Each SERPINFO is a YAML file that describes how to locate search results on a specific search engine's results page, so that uBlacklist can block, highlight, or attach buttons to them.

uBlacklist ships with built-in SERPINFO definitions for the supported search engines. You can also subscribe to SERPINFO published by others, or write your own to support a search engine that is not yet covered.

The SERPINFO options page is opened from Other search engines on the extension's options page. There you can enable or disable individual SERPINFO, subscribe to ones published by others, and write your own under My SERPINFO.

File Format

SERPINFO files use YAML format with the following structure:

name: SearchEngineName
version: 1.0.0
homepage: https://example.com/
lastModified: 2023-09-01T00:00:00Z

pages:
- name: PageType
matches:
- https://search.example.com/search?*
results:
- root: .result-selector
url: a.link
props:
title: .title
commonProps:
$site: example
$category: web

Top-Level Properties

RequiredPropertyTypeDescription
namestringName of the SERPINFO
pagesarrayArray of page definitions
versionstringVersion of the SERPINFO
descriptionstringDescription of the SERPINFO
homepagestringHomepage URL of the SERPINFO
lastModifiedstringLast modification time in ISO format

Page Definition

Each item in the pages array represents a specific type of search result page (Web, Images, News, etc.):

RequiredPropertyTypeDescription
namestringName of the page definition
matchesarrayArray of match patterns to include pages
resultsarrayArray of result definitions
excludeMatchesarrayArray of match patterns to exclude pages
includeRegexstringRegular expression pattern to include pages
excludeRegexstringRegular expression pattern to exclude pages
userAgentstringCan be "any", "desktop", or "mobile"
commonPropsobjectCommon properties applied to all results on this page
delaynumber or booleanDelay in milliseconds after page load, or boolean to enable/disable delay

Result Definition

Each item in the results array defines how to extract data from a single search result:

RequiredPropertyTypeDescription
rootstring or arrayRoot command to locate result elements
urlstring or arrayProperty command to extract the URL from the result
namestringName of the result definition
propsobjectKey-value pairs where keys are property names and values are property commands for extraction
buttonarrayButton command to add block buttons to results
preserveSpacebooleanDetermines whether to retain the layout space of blocked results, preventing layout shifts

Commands

SERPINFO supports various commands for complex extraction scenarios, organized into three categories:

Root Commands

  • <css-selector> or [selector, <css-selector>]: Find elements matching the CSS selector
  • [upward, <level>, <root-command>]: Navigate up the DOM tree by <level> steps from each element found by the root command

Element Commands

  • <css-selector> or [selector, <css-selector>, <element-command>?]: Get element by CSS selector
  • [upward, <level>, <element-command>?]: Navigate up in the DOM tree by <level> steps

Note: When <element-command> is omitted in any of the above commands, the current root element is implicitly used.

Property Commands

  • <css-selector>: Get element's textContent property (default) or href attribute (when used in URL extraction) from element matching the CSS selector
  • [attribute, <name>, <element-command>?]: Get attribute value
  • [property, <name>, <element-command>?]: Get property value
  • [const, <value>]: Return a constant string
  • [domainToURL, <property-command>]: Convert a domain to a URL
  • [regexInclude, <pattern>, <property-command>]: Include only if matches regex
  • [regexExclude, <pattern>, <property-command>]: Exclude if matches regex
  • [regexSubstitute, <pattern>, <replacement>, <property-command>]: Replace with regex
  • [or, [<property-command>*], <element-command>?]: Try multiple commands in sequence

Note: When <element-command> is omitted in any of the above commands, the current root element is implicitly used.

Button Commands

  • [icon, <options>?, <element-command>?]: Add an icon button with a specific style. Options include:

    • style: Inline style (e.g., "top: 16px; right: 32px; --ub-icon-size: 16px;")
  • [text, <options>?, <element-command>?]: Add a text button with a specific position and style. Options include:

    • position: "beforebegin", "afterbegin", "beforeend", or "afterend" (default: "beforeend")
    • style: Inline style (e.g., "color: var(--fg-color); font-size: 14px;")
  • [inset, <options>?, <element-command>?]: Add a button at a specific position within the element. Options include:

    • top: CSS length or percentage
    • right: CSS length or percentage
    • bottom: CSS length or percentage
    • left: CSS length or percentage
    • zIndex: z-index value (default: 1)

Note: When <element-command> is omitted, the current root element is used.

Example

Here's an example for Bing Image Search:

name: Bing
version: 0.1.0
homepage: https://github.com/ublacklist/builtin#readme
license: MIT
lastModified: 2023-04-05T11:11:20Z

pages:
- name: Images (desktop)
matches:
- https://www.bing.com/images/search?*
userAgent: desktop
results:
- root: [upward, 1, ".iuscp"]
url:
- regexSubstitute
- '"purl":"([^"]+)'
- "\\1"
- [attribute, "m", ".iusc"]
props:
title: [attribute, "title", "li > a"]
button: [inset, { top: "32px", right: 0 }, ".iuscp"]
commonProps:
$site: bing
$category: images

Subscription

Publishing Your SERPINFO

You can share your SERPINFO definitions with others by hosting your YAML file and providing a subscription link.

  1. Create your SERPINFO YAML file following the format described above
  2. Host the file at a publicly accessible URL (e.g., GitHub, your own website)
  3. Share the subscription link with users using this format:
https://ublacklist.github.io/serpinfo/subscribe?url=<url-encoded-url>

Where <url-encoded-url> is your hosted YAML file's URL encoded with URL encoding.

Subscribing to SERPINFO

To subscribe to a SERPINFO definition:

  1. Click on a subscription link provided by the publisher
  2. The extension will open its options page and prompt you to add the subscription
  3. Confirm to add the SERPINFO to your extension
observação

To use subscription links, you need to enable them manually. Go to the extension's options page and turn on "Enable SERPINFO subscription links".