Apoorv Tiwari
08/17/2021, 4:48 AMKishore Nallan
08/17/2021, 4:56 AMApoorv Tiwari
08/17/2021, 6:00 AMJason Bosco
08/17/2021, 6:14 AMApoorv Tiwari
08/17/2021, 8:25 AMJason Bosco
08/18/2021, 2:34 AMApoorv Tiwari
08/18/2021, 3:04 AMJason Bosco
08/25/2021, 6:53 AMApoorv Tiwari
08/26/2021, 6:54 AMJason Bosco
08/26/2021, 7:38 AMJason Bosco
08/26/2021, 7:39 AMApoorv Tiwari
08/26/2021, 7:40 AMJason Bosco
08/26/2021, 11:55 PMstart_urls
to be the base URL for all pages in the documentation. So if you set it to <https://docs.tooljet.io/docs/intro>
it expects all pages in your documentation to have that as the base URL.
You'd ideally want to set the base URL (start_urls
) to ["<https://docs.tooljet.io/docs/>"]
.
But then if you visit "https://docs.tooljet.io/docs/" it does an infinite redirect. If you can fix that and then update start_urls
, I think it should work after that...Apoorv Tiwari
08/27/2021, 3:07 AMApoorv Tiwari
08/27/2021, 3:20 AMJason Bosco
08/27/2021, 3:22 AMApoorv Tiwari
08/27/2021, 3:22 AMJason Bosco
08/27/2021, 3:24 AMifconfig
and use the 192.168.x.x IP or the 10.x.x.x IP from thereJason Bosco
08/27/2021, 3:25 AMApoorv Tiwari
08/27/2021, 4:42 AMJason Bosco
08/27/2021, 4:50 AMApoorv Tiwari
08/27/2021, 4:51 AMApoorv Tiwari
08/27/2021, 4:52 AMJason Bosco
08/27/2021, 4:53 AMApoorv Tiwari
08/27/2021, 4:54 AMJason Bosco
08/27/2021, 4:55 AMJason Bosco
08/27/2021, 4:55 AMApoorv Tiwari
08/27/2021, 5:44 AMJason Bosco
08/27/2021, 5:45 AMJason Bosco
08/27/2021, 5:45 AMJason Bosco
08/27/2021, 5:47 AMApoorv Tiwari
08/27/2021, 5:52 AMApoorv Tiwari
08/27/2021, 6:10 AMJason Bosco
08/27/2021, 6:28 AMApoorv Tiwari
08/27/2021, 8:17 AMApoorv Tiwari
08/30/2021, 5:41 AMJason Bosco
08/30/2021, 6:13 AMApoorv Tiwari
08/30/2021, 6:14 AMJason Bosco
08/30/2021, 6:15 AMApoorv Tiwari
08/30/2021, 6:16 AMApoorv Tiwari
08/30/2021, 6:16 AMApoorv Tiwari
08/30/2021, 6:17 AMJason Bosco
08/30/2021, 6:18 AMApoorv Tiwari
08/30/2021, 6:20 AMJason Bosco
08/30/2021, 6:32 AMApoorv Tiwari
08/30/2021, 6:38 AMJason Bosco
08/31/2021, 7:51 AM\00
it works fine!! In fact, since you have display: none
set further down, you can just remove those two lines, and it still hides the dark/light mode icons if that's your intention.
Also, you want to set start_urls
as ["<https://docs.tooljet.io/docs/>", "<https://docs.tooljet.io/docs/intro/>"]
.
Finally, if you're trying this on localhost, you want to use ngrok even when running locally, since the scraper apparently doesn't work with port numbers.Apoorv Tiwari
08/31/2021, 5:27 PMJason Bosco
08/31/2021, 5:30 PMApoorv Tiwari
08/31/2021, 5:30 PMJason Bosco
08/31/2021, 5:32 PMJason Bosco
08/31/2021, 5:32 PMApoorv Tiwari
08/31/2021, 5:32 PMJason Bosco
08/31/2021, 5:33 PMJason Bosco
08/31/2021, 5:33 PMJason Bosco
08/31/2021, 5:33 PMApoorv Tiwari
08/31/2021, 5:34 PMApoorv Tiwari
08/31/2021, 5:39 PMJason Bosco
08/31/2021, 5:41 PMJason Bosco
08/31/2021, 5:42 PMApoorv Tiwari
08/31/2021, 5:52 PMJason Bosco
08/31/2021, 6:12 PMApoorv Tiwari
08/31/2021, 6:18 PMJason Bosco
08/31/2021, 6:19 PMApoorv Tiwari
08/31/2021, 6:24 PMJason Bosco
08/31/2021, 6:25 PMJason Bosco
09/01/2021, 3:08 AMyarn build
and then yarn serve
and then run the scraper. When you just run yarn start
the content is client-side rendered via JS, so the scraper doesn't pick it up. When you run yarn build
the site gets statically built with the full HTML and then yarn serve
just serves the build directory.Jason Bosco
09/03/2021, 5:32 AM