Vinícius Salgado
06/22/2023, 8:10 PMVinícius Salgado
06/22/2023, 8:11 PMJason Bosco
06/22/2023, 8:12 PMJason Bosco
06/22/2023, 8:12 PMVinícius Salgado
06/22/2023, 8:12 PMallowed_domains
to adjust the scope accordingly!
# If the scraped site is behind the CloudFlare Access.
CF_ACCESS_CLIENT_ID=
CF_ACCESS_CLIENT_SECRET=
# WARNING! Please be aware that the scraper sends auth headers to every scraped site, so use allowed_domains
to adjust the scope accordingly!
# If the scraped site is behind the Google Cloud Identity-Aware Proxy
IAP_AUTH_CLIENT_ID=
IAP_AUTH_SERVICE_ACCOUNT_JSON=
CHROMEDRIVER_PATH=./chrome-driver/chromedriverJason Bosco
06/22/2023, 8:13 PM8108
. Did you specifically intend to change it to 8107
?Vinícius Salgado
06/23/2023, 1:20 PMVinícius Salgado
06/23/2023, 1:21 PM{
"index_name": "sigma-calibration",
"start_urls": [
"<http://192.168.0.25:3000/>"
],
"sitemap_urls": [
"<http://192.168.0.25:3000/sitemap.xml>"
],
"stop_urls": [
"/tests"
],
"sitemap_alternate_links": true,
"selectors": {
"lvl0": {
"selector": "(//ul[contains(@class,'menu__list')]//a[contains(@class, 'menu__link menu__link--sublist menu__link--active')]/text() | //nav[contains(@class, 'navbar')]//a[contains(@class, 'navbar__link--active')]/text())[last()]",
"type": "xpath",
"global": true,
"default_value": "Documentation"
},
"lvl1": "article h1, header h1",
"lvl2": "article h2",
"lvl3": "article h3",
"lvl4": "article h4",
"lvl5": "article h5, article td:first-child",
"lvl6": "article h6",
"text": "article p, article li, article td:last-child"
},
"strip_chars": " .,;:#",
"custom_settings": {
"separatorsToIndex": "_",
"attributesForFaceting": [
"language",
"version",
"type",
"docusaurus_tag"
],
"attributesToRetrieve": [
"hierarchy",
"content",
"anchor",
"url",
"url_without_anchor",
"type"
]
},
"conversation_id": [
"833762294"
],
"nb_hits": 1
}
Vinícius Salgado
06/23/2023, 1:25 PMDEBUG:typesense.api_call:Making get /aliases/sigma-calibration
DEBUG:typesense.api_call:Try 1 to node 172.18.182.239:8108 -- healthy? True
DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): 172.18.182.239:8108
DEBUG:urllib3.connectionpool:<http://172.18.182.239:8108> "GET /aliases/sigma-calibration HTTP/1.1" 200 None
DEBUG:typesense.api_call:172.18.182.239:8108 is healthy. Status code: 200
DEBUG:typesense.api_call:Making put /aliases/sigma-calibration
DEBUG:typesense.api_call:Try 1 to node 172.18.182.239:8108 -- healthy? True
DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): 172.18.182.239:8108
DEBUG:urllib3.connectionpool:<http://172.18.182.239:8108> "PUT /aliases/sigma-calibration HTTP/1.1" 200 None
DEBUG:typesense.api_call:172.18.182.239:8108 is healthy. Status code: 200
DEBUG:typesense.api_call:Making delete /collections/sigma-calibration_1687525730
DEBUG:typesense.api_call:Try 1 to node 172.18.182.239:8108 -- healthy? True
DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): 172.18.182.239:8108
DEBUG:urllib3.connectionpool:<http://172.18.182.239:8108> "DELETE /collections/sigma-calibration_1687525730 HTTP/1.1" 200 None
DEBUG:typesense.api_call:172.18.182.239:8108 is healthy. Status code: 200
Jason Bosco
06/23/2023, 7:18 PMJason Bosco
06/23/2023, 7:19 PMVinícius Salgado
06/23/2023, 7:39 PM