Rubai Mandal
03/13/2023, 7:04 AM<https://example.com>
instead of <http://host.docker.internal/>..
, it’s worked perfectly .
can anyone please help me out .
here is config file
{
"index_name": "payment-page",
"js_render": true,
"js_wait": 10,
"use_anchors": false,
"user_agent": "Custom Bot",
"start_urls": [
"<http://host.docker.internal/payment-page/android/base-sdk-integration/session>"
],
"sitemap_alternate_links": false,
"selectors": {
"lvl0": "h1, h2 , .heading-text",
"lvl1": "h3, .label",
"lvl2": ".key-header, .step-card-header-text, .th-row",
"text": ".screen2 p:not(:empty), .hero-welcome, .screen2 li, .main-screen, .only-steps p:not(:empty),td"
},
"strip_chars": " .,;:#",
"scrap_start_urls": true,
"custom_settings": {
"synonyms": {
"relevancy": [
"relevant",
"relevance"
],
"relevant": [
"relevancy",
"relevance"
],
"relevance": [
"relevancy",
"relevant"
]
}
}
}
output : > DocSearch: http://host.docker.internal/payment-page/android/base-sdk-integration/session 0 records)Rubai Mandal
03/13/2023, 10:14 AMKishore Nallan
03/13/2023, 10:20 AMJason Bosco
03/13/2023, 6:23 PMRubai Mandal
03/14/2023, 6:05 AM[
{
"index_name": "payment-page",
"js_render": true,
"js_wait": 5,
"use_anchors": false,
"user_agent": "Custom Bot",
"start_urls": [
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/overview/integration-architecture>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/overview/pre-requisites>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/base-sdk-integration/session>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/base-sdk-integration/order-status-api>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/base-sdk-integration/getting-sdk>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/base-sdk-integration/initiating-sdk>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/base-sdk-integration/processing-sdk>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/base-sdk-integration/handle-payment-response>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/base-sdk-integration/life-cycle-events>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/resources/error-codes>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/resources/transaction-status>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/resources/sample-payloads>"
],
"sitemap_alternate_links": false,
"selectors": {
"lvl0":"h1, h2 , .heading-text" ,
"lvl1": "h3, .label" ,
"lvl2": ".key-header, .step-card-header-text, .th-row",
"text":".screen2 p:not(:empty), .hero-welcome, .screen2 li, .main-screen, .only-steps p:not(:empty),td"
},
"strip_chars": " .,;:#",
"scrap_start_urls": true,
"custom_settings": {
"synonyms": {
"relevancy": ["relevant", "relevance"],
"relevant": ["relevancy", "relevance"],
"relevance": ["relevancy", "relevant"]
}
}
},
{
"index_name": "payment-page2",
"js_render": true,
"js_wait": 5,
"use_anchors": false,
"user_agent": "Custom Bot",
"start_urls": [
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/overview/integration-architecture>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/resources/transaction-status>",
"<https://9491-103-159-11-202.in.ngrok.io/payment-page/android/resources/sample-payloads>"
],
"sitemap_alternate_links": false,
"selectors": {
"lvl0":"h1, h2 , .heading-text" ,
"lvl1": "h3, .label" ,
"lvl2": ".key-header, .step-card-header-text, .th-row",
"text":".screen2 p:not(:empty), .hero-welcome, .screen2 li, .main-screen, .only-steps p:not(:empty),td"
},
"strip_chars": " .,;:#",
"scrap_start_urls": true,
"custom_settings": {
"synonyms": {
"relevancy": ["relevant", "relevance"],
"relevant": ["relevancy", "relevance"],
"relevance": ["relevancy", "relevant"]
}
}
}
]
Jason Bosco
03/14/2023, 2:03 PMRubai Mandal
03/14/2023, 2:26 PMJason Bosco
03/14/2023, 3:33 PMJason Bosco
03/14/2023, 3:34 PMRubai Mandal
03/14/2023, 6:31 PMJason Bosco
03/14/2023, 6:33 PMRubai Mandal
03/14/2023, 6:49 PMRubai Mandal
03/14/2023, 6:50 PMpayment-page
& payment-page2.
are two different productJason Bosco
03/14/2023, 11:39 PMRubai Mandal
03/15/2023, 5:57 AMpayment page
and in-app-upi
so , we want to filter on search result such that one product's search hits not comes in others product search result .
so how would we able to do that in a single config file .Jason Bosco
03/15/2023, 5:35 PMpayment page
docs pages, you’d add a meta tag to all pages called say:
<meta name="docsearch:product_tag" content="payment_page" />
In your in-app-upi
docs pages, you’d add a meta tag to all pages called say:
<meta name="docsearch:product_tag" content="in_app_upi" />
Then on the front-end, depending on the product docs the user is visiting right now, you would pass in a filter_by
parameter in the docsearch.js config like this:
https://typesense.org/docs/guide/docsearch.html#option-c-custom-docs-framework-with-docsearch-js-v3-modal-layout
See typesenseSearchParameters.filter_by
.
In your case you’d set that like this:
docsearch({
container: '#searchbar',
typesenseCollectionName: 'docs',
typesenseServerConfig: { ... },
typesenseSearchParameters: {
filter_by: 'product:=in_app_upi'
},
});
Rubai Mandal
03/16/2023, 6:25 AMRubai Mandal
03/16/2023, 10:39 AM.docsearch({
container: '#searchbar',
typesenseCollectionName: "Developer_Docs",
typesenseServerConfig: {
nodes: [{
host: 'localhost',
port: '8108',
protocol: 'http'
}],
apiKey: 'xyz',
},
typesenseSearchParameters: {
filter_by: `product:=${documentationJSON.documentation.productId}`
},
});
here is my docsearch.jsconfig . for me it's won't work ,it's fetching 0 result on search-bar .
can you please help me out what I am doing wrong here ?
here is my meta tag
<meta name="docsearch:product_tag" content="{$page.params.products}" />
Jason Bosco
03/16/2023, 4:22 PMJason Bosco
03/16/2023, 4:22 PMGET /collections
against your typesense node and post the output of that?Rubai Mandal
03/16/2023, 8:22 PMRubai Mandal
03/16/2023, 8:31 PMJason Bosco
03/16/2023, 8:35 PMfilter_by: `product:=${documentationJSON.documentation.productId}`
to
filter_by: `product_tag:=${documentationJSON.documentation.productId}`
and try again?Jason Bosco
03/16/2023, 8:36 PMRubai Mandal
03/16/2023, 8:41 PMRubai Mandal
03/17/2023, 6:49 AM{
"index_name": "Developer_Docs",
"js_render": true,
"js_wait": 5,
"use_anchors": false,
"user_agent": "Custom Bot",
"start_urls": [
"<https://62bc-119-82-122-182.in.ngrok.io/payment-page/ios/overview/integration-architecture>",
"<https://62bc-119-82-122-182.in.ngrok.io/payment-page/ios/overview/pre-requisites>",
"<https://62bc-119-82-122-182.in.ngrok.io/payment-page/ios/resources/error-codes>"
],
"sitemap_urls": [
"<https://62bc-119-82-122-182.in.ngrok.io/payment-page/android/sitemap.xml>"
],
"sitemap_alternate_links": true,
"selectors": {
"lvl0": "h1,h2,[data-search-class='lvl0']",
"lvl1": "h3,[data-search-class='lvl1']",
"lvl2": "[data-search-class='lvl2']",
"text": "p:not(:empty),[data-search-class='text']"
},
"strip_chars": " .,;:#",
"scrap_start_urls": true,
"custom_settings": {
"synonyms": {
"relevancy": [
"relevant",
"relevance"
],
"relevant": [
"relevancy",
"relevance"
],
"relevance": [
"relevancy",
"relevant"
]
}
}
}
Jason Bosco
03/17/2023, 4:36 PMRubai Mandal
03/17/2023, 6:45 PMRubai Mandal
03/17/2023, 6:46 PMsitemap_urls
not working for meRubai Mandal
03/18/2023, 8:52 PM{
"index_name": "Developer_Docs",
"js_render": true,
"js_wait": 5,
"use_anchors": false,
"user_agent": "Custom Bot",
"start_urls": [
"<https://docs.juspay.in/payment-page/android/overview/integration-architecture>"
],
"sitemap_urls": [
"<https://testing-chi-eight.vercel.app/sitemap.xml>"
],
"sitemap_alternate_links": true,
"selectors": {
"lvl0": "h1,h2,[data-search-class='lvl0']",
"lvl1": "h3,[data-search-class='lvl1']",
"lvl2": "[data-search-class='lvl2']",
"text": "p,[data-search-class='text']"
},
"strip_chars": " .,;:#",
"scrap_start_urls": true,
"custom_settings": {
"synonyms": {
"relevancy": [
"relevant",
"relevance"
],
"relevant": [
"relevancy",
"relevance"
],
"relevance": [
"relevancy",
"relevant"
]
}
}
}
Jason Bosco
03/18/2023, 9:01 PMstart_urls
, regardless of it being mentioned in the sitemap or not.
So for eg, in the above config, it will only crawl pages like this: <https://docs.juspay.in/payment-page/android/overview/integration-architecture/*>
Jason Bosco
03/18/2023, 9:02 PM<https://docs.juspay.in/payment-page>
in the start_urls
since that seems to be your base url?Rubai Mandal
03/18/2023, 9:07 PMJason Bosco
03/18/2023, 9:08 PMRubai Mandal
03/18/2023, 9:10 PMRubai Mandal
03/18/2023, 9:10 PMDEBUG:scrapy.dupefilters:Filtered duplicate request: <GET <https://docs.juspay.in/payment-page>> - no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates)
I got this