ICDST Search Engine Syntax Guide
The ICDST search engine offers advanced commands to enhance your data mining experience. This guide explains the main commands, their syntax, and usage. Available commands include inurl, intitle, indes, site, domain, subdomains, filetype, double quotes (""), date, and related. The scope of these commands is represented by brackets []:
1- inurl: word or inurl:" some text "
Scope: https://[.....] or http://[.....]
This command filters results to include specified text in their URLs.
EXAMPLE: inurl:about-us
This finds all websites that include "about-us" in their URLs.
2- intitle: word or intitle:" some text "
Scope: <title>[.....]</title>
This command filters results to include specified text in their title tags.
EXAMPLE: intitle:"introduction to"
This finds all webpages with title tags containing the phrase "introduction to".
3- indes: word or indes:" some text "
Scope: <meta name="description" content="[.....]"/>
This command filters results to include specified text in their description meta tags.
EXAMPLE: indes:"learn with"
This finds all web pages where the content attribute of their description meta tags contains "learn with."
4- site: domain.com
Scope: httpx://www.domain.com/[.....] or httpx://domain.com/[.....]
This command includes http and https (httpx) plus www and non-www versions of domain.com.
EXAMPLE: launch events site:nasa.gov
This finds web pages from httpx://[www. and non-www].nasa.gov with information on "launch events."
5- domain: TLD
Scope: httpx://subdomain1.domain.[.....] /directory1/page1.html
This command filters results to include only specified top-level domains in their domain part.
EXAMPLE: how to write an article domain:edu
This command searches web pages with a .edu TLD for the phrase "how to write an article."
6- subdomains: domain.com
Scope: httpx://[.....].domain.com /directory1/page1.html
This command includes all subdomains (multiple levels) in search results for specified domain.com, unlike the site command which includes only httpx with www and non-www versions.
EXAMPLE: rocket engine subdomains:nasa.gov
This finds rocket engine related pages in all multi-level subdomains of *.*.nasa.gov.
7- filetype: file_extension
Scope: https://subdomain1.domain.com/directory1/page1.[.....]
This command filters results to include only specified file extensions.
EXAMPLE: health and nutrition filetype:html
This shows html webpages related to health and nutrition.
8- " some text here "
This command (double quoted text) filters results to include only those that contain the precisely quoted text.
EXAMPLE: "how to learn"
This finds exact double quoted keyword(s) in the index.
9- "" some text here ""
This command (double double quoted text), unlike double quoted queries, is used to find results with missing or other words (or phrases) in between.
EXAMPLE: ""The weather is today""
This finds results with "the weather is very hot today" or "the weather is so cold today" or any other results with missing words in between.
10- date: num {s,m,h,d,w,o,y}
This command limits output results based on timespans where {s,m,h,d,w,o,y} represent {seconds, minutes, hours, days, months, weeks, years}.
EXAMPLE: cats and mice date:5.5o
This shows results in the period of the last 5.5 months based on index time.
11- related: example.com or related:"example.com"
This command lists content providers which have the most common keywords with httpx://[www. or non-www]example.com.
EXAMPLE: related:archive.org
This lists domains which have the most common keywords with archive.org in the index.