The ICDST search engine facilitates search by introducing some commands that allow users to have a better and deeper data mining experience on this platform. This page describes the main existing commands with their correct sytanx and usage. Available commands include inurl, intitle, indes, site, domain, subdomains, filetype, double quotes (""), date, and related. The range of these commands is represented by brackets []:
1- inurl:word or inurl:"some text"
scope: https://[.....] or http://[.....]
This
command allows results that contain specified text in their URLs.
EXAMPLE: inurl:about-us
This finds all websites that include "about-us" in their URLs.
2- intitle:word or intitle:"some text"
scope: <title>[.....]</title>
This
command allows results that contain specified text in their title tags.
EXAMPLE: intitle:"introduction to"
This finds all webpages with title tags which include "introduction to" phrase.
3- indes:word or indes:"some text"
scope: <meta name="description" content="[.....]"/>
This
command allows results that contain specified text in their description meta
tags.
EXAMPLE: indes:"learn with"
This finds all web pages from the index where the content attribute of their description meta tags contains "learn with."
4- site:domain.com
scope: httpx://www.domain.com/[.....] or httpx://domain.com/[.....]
This
command allows http and https (httpx) plus www and non-www versions of
domain.com.
EXAMPLE: launch events site:nasa.gov
This finds web pages from httpx://[www. and non-www].nasa.gov with information on the keyword "launch events."
5- domain:TLD
scope: httpx://subdomain1.domain.[.....] /directory1/page1.html
This
command allows results that contain only specified top-level domains in their
domain part.
EXAMPLE: how to write an article domain:edu
This command searches web pages with TLD of .edu for the phrase "how to write an article."
6- subdomains:domain.com
scope: httpx://[.....].domain.com /directory1/page1.html
This
command includes all subdomains (multiple levels) in search results for
specified domain.com as opposed to site comannd which includes only httpx with
www and non-www versions.
EXAMPLE: rocket engine subdomains:nasa.gov
This finds rocket engine related pages in all multi level subdomains of *.*.nasa.gov.
7- filetype: file_extension
scope: https://subdomain1.domain.com/directory1/page1.[.....]
This
command only allows specified file_extensions in search results.
EXAMPLE: health and nutrition filetype:html
This shows html webpages related to health and nutrition.
8- "some text here"
This
command (double quoted text) shows only search results that contain precisely
quoted text.
EXAMPLE: "how to learn"
This finds exact double quoted keyword(s) in the index.
9- ""some text here""
This
command (double double quoted text), unlike double quoted queries, is used to find results with missing or other words (or phrases) in between.
EXAMPLE: ""The weather is today""
This finds results with "the weather is very hot today" or "the weather is so cold today" or any other results with missing words in between.
10- date:num{s,m,h,d,w,o,y}
This
command limits output results based on timespans where {s,m,h,d,w,o,y} represent {seconds, minutes, hours, days, months, weeks, years}.
EXAMPLE: cats and mice date:5.5o
This shows results in the period of the last 5.5 months based on index time.
11- related:example.com or related:"example.com"
This command lists content providers which have most common keywords with httpx://[www. or non-www]example.com.
EXAMPLE: related:archive.org
This lists domains which have most common keywords with archive.org in the index.