Hello I am transitioning my Docusaurus website from Algolia typesense #community-help

Hello. I am transitioning my Docusaurus website fr...

James Nesta

02/08/2023, 9:32 PM

Hello. I am transitioning my Docusaurus website from Algolia --> Typesense: https://isaacscript.github.io/ For my crawler, I am using the default config that is recommended in the official Typesense documentation: https://github.com/algolia/docsearch-configs/blob/master/configs/docusaurus-2.json My search is currently "working" insofar that some things appear to be searchable. However, it seems that the crawler did not index some of the words in level 3 headers. For example: https://isaacscript.github.io/isaacscript-common/other/enums/StatType#tear_falling_acceleration Searching for

TEAR_FALLING_ACCELERATION

results in:

No results for "TEAR_FALLING_ACCELERATION"

Is there something else that I forgot to do for a Docusaurus website?

Jason Bosco

02/08/2023, 10:50 PM

It seems to work if I remove the underscores during search…

Jason Bosco

02/08/2023, 10:51 PM

Also notice how the field is indexed by the scraper… it has space before and after the underscore in

hierarcy.lvl3

Jason Bosco

02/08/2023, 10:51 PM

Somehow the generated markup seems to have this issue…

Jason Bosco

02/08/2023, 10:52 PM

So long story short, there is actually no exact match for

TEAR_FALLING_ACCELERATION

in the ~~index~~ collection

James Nesta

02/08/2023, 10:53 PM

Oh, that's interesting. I wonder why Algolia is able to index it properly though.

Jason Bosco

02/08/2023, 10:54 PM

Could you try adding

split_join_tokens: true

themeConfig.typesense.typesenseSearchParameters

to see if that helps?

Jason Bosco

02/08/2023, 10:55 PM

Correction:

split_join_tokens: always

James Nesta

02/08/2023, 11:24 PM

Thanks Jason. I did that, and it didn't seem to change anything. I still have 0 results for "TEAR_FALLING_ACCELERATION".

Jason Bosco

02/08/2023, 11:26 PM

Another thing to try is to set

symbols_to_index

in the collection schema, and set it to

["_"]

Jason Bosco

02/08/2023, 11:26 PM

Documented in this table: https://typesense.org/docs/0.24.0/api/collections.html#schema-parameters

Jason Bosco

02/08/2023, 11:27 PM

The scraper currently uses a fixed schema, so you would have to fork the scraper and change the schema here: https://github.com/typesense/typesense-docsearch-scraper/blob/a005d7a8bbd45bd71fd3895024f05663e9f797c6/scraper/src/typesense_helper.py#L36-L53

Jason Bosco

02/08/2023, 11:28 PM

So it should be something like:

Copy code

'name': self.collection_name_tmp,
'symbols_to_index': '_',
'fields': [...]
...

James Nesta

02/08/2023, 11:59 PM

I filed a ticket with Docusaurus, and they say that this is Typesense's fault: https://github.com/facebook/docusaurus/issues/8645#issuecomment-1423386178 I'll work on forking the plugin now, thank you.

🤔 1

James Nesta

02/09/2023, 12:41 AM

I consider myself to be an expert at Linux systems, but I wasn't able to follow the instructions here: https://github.com/typesense/typesense-docsearch-scraper#releasing-a-new-version From a fresh Ubuntu 22 server, I ran into several roadblocks, the last of which is dotenv related, which I presume should automatically be handled by the pipenv environment. Can we update the README file with more detailed instructions of exactly what to install/run from a fresh Ubuntu 22, step by step? I understand not wanting to add detailed documentation for every Linux distribution, but I feel like at the very least the thing should be able to be built from on Ubuntu.

Jason Bosco

02/09/2023, 12:42 AM

Hmm, I follow those exact instructions any time we need to publish a new version of the scraper. I do run it from macOS though. May I know what errors you ran into?

James Nesta

02/09/2023, 12:46 AM

• The first error was running pipenv after installing it from apt, which resulted in a bunch of errors. • After some googling, I saw it was recommended to install pipenv from pypi instead, so I apt removed it and tried that:

Copy code

pip install --user pipenv

• That worked, but then I got an error about Python 3.6 not being installed. • In order to install Python 3.6, I determined that I needed to install pyenv, so I ran:

Copy code

sudo apt-get update; sudo apt-get install make build-essential libssl-dev zlib1g-dev libbz2-dev libreadline-dev libsqlite3-dev wget curl llvm libncursesw5-dev xz-utils tk-dev libxml2-dev libxmlsec1-dev libffi-dev liblzma-dev

And then:

Copy code

curl <https://pyenv.run> | bash

Then I was able to do:

Copy code

pyenv install 3.6.3

But then I got an error when creating the env:

Copy code

james@2wsx:~/typesense-docsearch-scraper$ pipenv shell
Creating a virtualenv for this project...
Pipfile: /home/james/typesense-docsearch-scraper/Pipfile
Using /home/james/.pyenv/versions/3.6.3/bin/python3.6m (3.6.3) to create virtualenv...
⠧ Creating virtual environment...fail
Traceback (most recent call last):
  File "/usr/lib/python3/dist-packages/virtualenv/seed/embed/via_app_data/via_app_data.py", line 84, in _get
    result = get_wheel(
  File "/usr/lib/python3/dist-packages/virtualenv/seed/wheels/acquire.py", line 26, in get_wheel
    wheel = from_bundle(distribution, version, for_py_version, search_dirs, app_data, do_periodic_update, env)
  File "/usr/lib/python3/dist-packages/virtualenv/seed/wheels/bundle.py", line 13, in from_bundle
    wheel = load_embed_wheel(app_data, distribution, for_py_version, of_version)
  File "/usr/lib/python3/dist-packages/virtualenv/seed/wheels/bundle.py", line 33, in load_embed_wheel
    wheel = get_embed_wheel(distribution, for_py_version)
  File "/usr/lib/python3/dist-packages/virtualenv/seed/wheels/embed/__init__.py", line 77, in get_embed_wheel
    raise Exception((
Exception: Wheel for pip for Python 3.6 is unavailable. apt install python3-pip-whl
created virtual environment CPython3.6.3.final.0-64 in 1417ms
  creator CPython3Posix(dest=/home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y, clear=False, no_vcs_ignore=False, global=False)
  seeder FromAppData(download=False, pip=bundle, setuptools=bundle, wheel=bundle, via=copy, app_data_dir=/home/james/.local/share/virtualenv)
    added seed packages: pip==21.3.1, setuptools==59.6.0, wheel==0.37.1
  activators BashActivator,CShellActivator,FishActivator,NushellActivator,PowerShellActivator,PythonActivator

✔ Successfully created virtual environment!
Virtualenv location: /home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y
Launching subshell in virtual environment...
 . /home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y/bin/activate

I proceeded anyway, and crossed my fingers, but the next command also fails:

Copy code

james@2wsx:~/typesense-docsearch-scraper$ pipenv shell
Launching subshell in virtual environment...
 . /home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y/bin/activate
james@2wsx:~/typesense-docsearch-scraper$  . /home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y/bin/activate
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$ ./docsearch docker:build
Traceback (most recent call last):
  File "./docsearch", line 3, in <module>
    from cli.src.index import run
  File "/home/james/typesense-docsearch-scraper/cli/src/index.py", line 3, in <module>
    from dotenv import load_dotenv
ModuleNotFoundError: No module named 'dotenv'
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$ pip install dotenv
Collecting dotenv
  Downloading dotenv-0.0.5.tar.gz (2.4 kB)
  Preparing metadata (setup.py) ... error
  ERROR: Command errored out with exit status -11:
   command: /home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y/bin/python -c 'import io, os, sys, setuptools, tokenize; sys.argv[0] = '"'"'/tmp/pip-install-apjacqc9/dotenv_ddf8c6da6ef9463bb8285aad152279ed/setup.py'"'"'; __file__='"'"'/tmp/pip-install-apjacqc9/dotenv_ddf8c6da6ef9463bb8285aad152279ed/setup.py'"'"';f = getattr(tokenize, '"'"'open'"'"', open)(__file__) if os.path.exists(__file__) else io.StringIO('"'"'from setuptools import setup; setup()'"'"');code = f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' egg_info --egg-base /tmp/pip-pip-egg-info-lxvlxqvp
       cwd: /tmp/pip-install-apjacqc9/dotenv_ddf8c6da6ef9463bb8285aad152279ed/
  Complete output (0 lines):
  ----------------------------------------
WARNING: Discarding <https://files.pythonhosted.org/packages/e2/46/3754073706e31670eed18bfa8a879305b56a471db15f20523c2427b10078/dotenv-0.0.5.tar.gz#sha256=b58d2ab3f83dbd4f8a362b21158a606bee87317a9444485566b3c8f0af847091> (from <https://pypi.org/simple/dotenv/>). Command errored out with exit status -11: python setup.py egg_info Check the logs for full command output.

Jason Bosco

02/09/2023, 12:50 AM

Could you run

pip install

? And then try again?

James Nesta

02/09/2023, 12:51 AM

Inside the pipenv shell, or inside the normal shell?

Jason Bosco

02/09/2023, 12:51 AM

normal shell

James Nesta

02/09/2023, 12:51 AM

pip isn't on Ubuntu by default, because it wants you to use either pip2 or pip3.

Jason Bosco

02/09/2023, 12:51 AM

Oh wait, it’s

pipenv install

James Nesta

02/09/2023, 12:52 AM

Fails: https://pastebin.com/e5pLZ6Hu

Jason Bosco

02/09/2023, 12:59 AM

Hmm, this is unfortunately beyond my python tooling knowledge… to be able to help.

Jason Bosco

02/09/2023, 12:59 AM

I vaguely remember running into a bunch of issues with pipenv (different ones than the ones you shared above) and finally stumbled my way around to getting it to work on my local machine.

Jason Bosco

02/09/2023, 1:00 AM

So it seems like there are many confounding variables on what could be going wrong with the python > pip > pipenv environment

James Nesta

02/09/2023, 3:58 PM

Jason, I tried on MacOS and ran into the exact same issue. Can you update the README file with instructions for how to do it from a fresh Mac or a fresh Linux with nothing else installed?

Jason Bosco

02/09/2023, 4:40 PM

I recently upgraded my mac, so this is an almost brand new OS installation as far as docsearch-scraper is concerned, but I did already have pyenv installed. This is what I did and it worked for me:

Copy code

brew install pyenv
pyenv install 3.6
pyenv local 3.6
pip install --upgrade pip
pip install --user pipenv
pipenv install
pipenv shell

James Nesta

02/09/2023, 7:58 PM

I must insist, I believe that the errors are because of a bogus Pipfile.lock. If you remove the Pipfile.lock, and then do

pipenv install

, you will get a bunch of errors about dependencies not being able to be resolved, so this definitely seems like a problem with the repository itself.

James Nesta

02/09/2023, 8:04 PM

Looks like the relevant error message is:

Copy code

[pipenv.exceptions.ResolutionFailure]: Warning: Your dependencies could not be resolved. You likely have a mismatch in your sub-dependencies.
  You can use $ pipenv install --skip-lock to bypass this mechanism, then run $ pipenv graph to inspect the situation.
  Hint: try $ pipenv lock --pre if it is a pre-release dependency.
ERROR: No matching distribution found for slacker==0.9.60

Jason Bosco

02/09/2023, 8:08 PM

That version exists though: https://pypi.org/project/slacker/0.9.60/

Jason Bosco

02/09/2023, 8:08 PM

And it installed on my machine

Jason Bosco

02/09/2023, 8:08 PM

Could you make sure you’re running python 3.6 and not the default 2.7 that comes installed on macOS?

James Nesta

02/09/2023, 8:10 PM

I switched back to Ubuntu:

Copy code

james@2wsx:~/typesense-docsearch-scraper$ python --version
Python 3.6.15

James Nesta

02/09/2023, 8:10 PM

For more information, see this pull request: https://github.com/typesense/typesense-docsearch-scraper/pull/23/files

James Nesta

02/09/2023, 8:10 PM

You can follow these exact instructions step by step, which should be able to reproduce the problem.

Jason Bosco

02/09/2023, 9:49 PM

I was able to replicate this on a brand new Ubuntu 22 machine

Jason Bosco

02/09/2023, 9:49 PM

Researching why this is happening

James Nesta

02/09/2023, 9:50 PM

It happens on macOS too, for what it is worth.

Jason Bosco

02/09/2023, 11:16 PM

I just spent a couple of hours on this, and I somehow got it to work, but I don’t know which sequence of steps made it work 😢

James Nesta

02/09/2023, 11:17 PM

Well, we need to find out those steps and edit my PR accordingly.

Jason Bosco

02/09/2023, 11:17 PM

Going to copy my bash history, and then try again on a new machine

James Nesta

02/09/2023, 11:17 PM

When I was testing, I found it useful to use snapshots in VirtualBox.

Jason Bosco

02/09/2023, 11:18 PM

No virtualbox on M1 sadly

James Nesta

02/09/2023, 11:18 PM

For example, I did a snapshot after a fresh install, and then I did another snapshot after the

pyenv local 3.6

command, since that takes a particularly long time.

James Nesta

02/09/2023, 11:19 PM

Back when I used macOS for work I used VMWare Fusion a lot.

James Nesta

02/09/2023, 11:19 PM

Is that updated for M1?

Jason Bosco

02/09/2023, 11:19 PM

Haven’t used VMWare Fusion… But I heard Virtual Box is planning to support M1, with no ETA

James Nesta

02/09/2023, 11:20 PM

Ah, looks like it does: https://www.techzine.eu/news/applications/84801/vmware-fusion-now-supports-windows-on-apple-m1-processors/

James Nesta

02/09/2023, 11:21 PM

I guess Parallels is another option, although I think Fusion is much better.

Jason Bosco

02/09/2023, 11:22 PM

I used to use Parallels about 10 years ago, the UX was pretty awesome

James Nesta

02/09/2023, 11:22 PM

Looks like Parallels also supports M1.

Jason Bosco

02/09/2023, 11:22 PM

Good to know!

Jason Bosco

02/10/2023, 12:17 AM

Ok here you go. It was a python version issue. If you upgrade to python 3.9 it works

Copy code

sudo apt update && sudo apt install   build-essential   curl   libbz2-dev   libffi-dev   liblzma-dev   libncursesw5-dev   libreadline-dev   libsqlite3-dev   libssl-dev   libxml2-dev   libxmlsec1-dev   llvm   make   tk-dev   wget   xz-utils   zlib1g-dev   --yes
curl <https://pyenv.run> | bash
echo >> ~/.bashrc
echo '# Adding pyenv' >> ~/.bashrc
echo 'export PYENV_ROOT="$HOME/.pyenv"' >> ~/.bashrc
echo 'command -v pyenv >/dev/null || export PATH="$PYENV_ROOT/bin:$PATH"' >> ~/.bashrc
echo 'eval "$(pyenv init -)"' >> ~/.bashrc
source ~/.bashrc
pyenv install 3.9
pyenv local 3.9
pip install --upgrade pip
echo >> ~/.bashrc
echo '# Fixing pipx warning' >> ~/.bashrc
echo 'PATH=$PATH:~/.local/bin' >> ~/.bashrc
source ~/.bashrc
pip install --user pipenv
git clone <https://github.com/typesense/typesense-docsearch-scraper.git>
cd typesense-docsearch-scraper/
pipenv install
vim Pipfile # <==== Edit python version to 3.9 in Pipefile
pipenv install
pipenv shell

James Nesta

02/10/2023, 2:08 AM

It doesn't work for me:

Copy code

james@2wsx:~/typesense-docsearch-scraper$ pipenv install
Pipfile.lock (ba301c) out of date, updating to (402916)...
Locking [packages] dependencies...
Building requirements...
Resolving dependencies...
✘ Locking Failed!
⠹ Locking...
Traceback (most recent call last):
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/resolver.py", line 845, in <module>
    main()
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/resolver.py", line 819, in main
    _ensure_modules()
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/resolver.py", line 16, in _ensure_modules
    spec.loader.exec_module(pipenv)
  File "<frozen importlib._bootstrap_external>", line 678, in exec_module
  File "<frozen importlib._bootstrap>", line 219, in _call_with_frames_removed
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/__init__.py", line 63, in <module>
    from .cli import cli
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/cli/__init__.py", line 1, in <module>
    from .command import cli  # noqa
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/cli/command.py", line 4, in <module>
    from pipenv import environments
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/environments.py", line 10, in <module>
    from pipenv.patched.pip._vendor.platformdirs import user_cache_dir
  File "/home/james/.local/lib/python3.9/site-packages/pipenv/patched/pip/_vendor/platformdirs/__init__.py", line 5
    from __future__ import annotations
    ^
SyntaxError: future feature annotations is not defined

James Nesta

02/10/2023, 2:13 AM

This is on a fresh Ubuntu 22, following these exact instructions.

Jason Bosco

02/10/2023, 2:30 AM

There’s a typo in my steps. I have a

pipenv install

before editing the python version in Pipfile

Jason Bosco

02/10/2023, 2:30 AM

Could you run

pipenv install

after editing python version?

James Nesta

02/10/2023, 2:30 AM

I've already edited the Python version in the Pipfile. I still get the error listed above.

Jason Bosco

02/10/2023, 2:31 AM

Let’s try

Copy code

pipenv --rm
pipenv --python 3.9
pipenv lock --clear
pipenv install

Jason Bosco

02/10/2023, 2:31 AM

Let me edit that, hang on

Jason Bosco

02/10/2023, 2:32 AM

ok done editing

James Nesta

02/10/2023, 2:36 AM

That worked. Can you edit my PR here to add the missing steps? https://github.com/typesense/typesense-docsearch-scraper/pull/23/files It's unclear from my side what those commands are doing.

🎉 1

Jason Bosco

02/10/2023, 2:36 AM

Sure will do

James Nesta

02/10/2023, 2:39 AM

Now, I'm getting a new error:

Copy code

james@2wsx:~/typesense-docsearch-scraper$ pipenv shell
Loading .env environment variables...
Loading .env environment variables...
Launching subshell in virtual environment...
 . /home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y/bin/activate
james@2wsx:~/typesense-docsearch-scraper$  . /home/james/.local/share/virtualenvs/typesense-docsearch-scraper-fJqFon_Y/bin/activate
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$ ./docsearch docker:build
/home/james/typesense-docsearch-scraper/cli/src/commands/run_tests.py:22: SyntaxWarning: "is" with a literal. Did you mean "=="?
  if args[1] is "no_browser":
ERROR: permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Get "<http://%2Fvar%2Frun%2Fdocker.sock/_ping>": dial unix /var/run/docker.sock: connect: permission denied
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$

Again, this is a fresh Ubuntu VM, and if it matters, I followed the normal steps to install Docker as according to the official Docker documentation here: https://docs.docker.com/engine/install/ubuntu/

Jason Bosco

02/10/2023, 2:40 AM

Docker needs to be run as root usually on Linux

Jason Bosco

02/10/2023, 2:40 AM

Also I’m not sure if the docker daemon autostarts after install

James Nesta

02/10/2023, 2:40 AM

Won't running as root mess up all of the careful Python-virtual-environment-related stuff that we have been carefully setting up over the past few hours?

James Nesta

02/10/2023, 2:41 AM

Copy code

(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$ sudo ./docsearch docker:build
[sudo] password for james:
/usr/bin/env: 'python': No such file or directory

Jason Bosco

02/10/2023, 2:41 AM

I meant the docker daemon

Jason Bosco

02/10/2023, 2:41 AM

Could you check if it’s running

Jason Bosco

02/10/2023, 2:41 AM

Does

sudo docker run hello-world

work?

James Nesta

02/10/2023, 2:41 AM

Yes:

James Nesta

02/10/2023, 2:41 AM

Copy code

james@2wsx:~/typesense-docsearch-scraper$ sudo docker run hello-world
Unable to find image 'hello-world:latest' locally
latest: Pulling from library/hello-world
2db29710123e: Pull complete
Digest: sha256:aa0cc8055b82dc2509bed2e19b275c8f463506616377219d9642221ab53cf9fe
Status: Downloaded newer image for hello-world:latest

Hello from Docker!
This message shows that your installation appears to be working correctly.

To generate this message, Docker took the following steps:
 1. The Docker client contacted the Docker daemon.
 2. The Docker daemon pulled the "hello-world" image from the Docker Hub.
    (amd64)
 3. The Docker daemon created a new container from that image which runs the
    executable that produces the output you are currently reading.
 4. The Docker daemon streamed that output to the Docker client, which sent it
    to your terminal.

To try something more ambitious, you can run an Ubuntu container with:
 $ docker run -it ubuntu bash

Share images, automate workflows, and more with a free Docker ID:
 <https://hub.docker.com/>

For more examples and ideas, visit:
 <https://docs.docker.com/get-started/>

Jason Bosco

02/10/2023, 2:42 AM

This seems to be the issue: https://www.digitalocean.com/community/questions/how-to-fix-docker-got-permission-denied-while-trying-to-connect-to-the-docker-daemon-socket

Jason Bosco

02/10/2023, 2:43 AM

Hmm

Jason Bosco

02/10/2023, 2:43 AM

Although in your case the hello world container works

James Nesta

02/10/2023, 2:43 AM

Well I'm running the hello world container as sudo.

Jason Bosco

02/10/2023, 2:44 AM

Oh right

James Nesta

02/10/2023, 2:44 AM

I will try following this guide.

👍 1

James Nesta

02/10/2023, 2:52 AM

Ok, that worked. I updated the pull request again.

James Nesta

02/10/2023, 2:53 AM

Now I'm getting a new error after running `./docsearch docker:build`:

Copy code

=> ERROR [13/26] RUN apt-get update -y && apt-get install -yq   google-chrome-stable=99.0.4844.51-1   unzip                                                            2.5s
------
 > [13/26] RUN apt-get update -y && apt-get install -yq   google-chrome-stable=99.0.4844.51-1   unzip:
#0 0.362 Get:1 <http://dl.google.com/linux/chrome/deb> stable InRelease [1811 B]
#0 0.444 Hit:2 <https://deb.nodesource.com/node_8.x> bionic InRelease
#0 0.450 Hit:3 <http://security.ubuntu.com/ubuntu> bionic-security InRelease
#0 0.462 Hit:4 <http://archive.ubuntu.com/ubuntu> bionic InRelease
#0 0.481 Get:5 <http://dl.google.com/linux/chrome/deb> stable/main amd64 Packages [1061 B]
#0 0.552 Hit:6 <http://archive.ubuntu.com/ubuntu> bionic-updates InRelease
#0 0.565 Hit:7 <http://ppa.launchpad.net/openjdk-r/ppa/ubuntu> bionic InRelease
#0 0.639 Hit:8 <http://archive.ubuntu.com/ubuntu> bionic-backports InRelease
#0 0.732 Fetched 2872 B in 0s (6409 B/s)
#0 0.732 Reading package lists...
#0 1.713 Reading package lists...
#0 2.353 Building dependency tree...
#0 2.482 Reading state information...
#0 2.495 E: Version '99.0.4844.51-1' for 'google-chrome-stable' was not found
------
Dockerfile.base:37
--------------------
  36 |     RUN echo "deb [arch=amd64]  <http://dl.google.com/linux/chrome/deb/> stable main" >> /etc/apt/sources.list.d/google-chrome.list
  37 | >>> RUN apt-get update -y && apt-get install -yq \
  38 | >>>   google-chrome-stable=99.0.4844.51-1 \
  39 | >>>   unzip
  40 |     RUN wget -q <https://chromedriver.storage.googleapis.com/99.0.4844.51/chromedriver_linux64.zip>
--------------------
ERROR: failed to solve: process "/bin/sh -c apt-get update -y && apt-get install -yq   google-chrome-stable=99.0.4844.51-1   unzip" did not complete successfully: exit code: 100
(typesense-docsearch-scraper) james@2wsx:~/typesense-docsearch-scraper$

Jason Bosco

02/10/2023, 5:52 AM

I just remembered another PR that fell off my radar… I think this actually will fix a lot of these issues: https://github.com/typesense/typesense-docsearch-scraper/pull/16/files

Jason Bosco

02/10/2023, 5:52 AM

Could you check out that branch and try running build from there?

Jason Bosco

02/10/2023, 5:52 AM

If that works, I can merge that PR in

James Nesta

02/10/2023, 6:13 AM

I tried it, and I get the same error relating to Google Chrome.

Jason Bosco

02/10/2023, 5:43 PM

Could you try changing it to the latest version of Chrome?

James Nesta

02/10/2023, 7:26 PM

That worked. Do you want me to add that to my PR?

Jason Bosco

02/10/2023, 7:27 PM

Yeah, that would be great

James Nesta

02/10/2023, 7:38 PM

Ok, when running the crawler, I get a new error:

Copy code

2023-02-10 19:38:18 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/>> (referer: None)
Traceback (most recent call last):
  File "/home/seleuser/.local/share/virtualenvs/seleuser-AdYDHarm/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks
    current.result = callback(  # type: ignore[misc]
  File "/home/seleuser/src/documentation_spider.py", line 180, in parse_from_start_url
    self.add_records(response, from_sitemap=False)
  File "/home/seleuser/src/documentation_spider.py", line 152, in add_records
    self.typesense_helper.add_records(records, response.url, from_sitemap)
  File "/home/seleuser/src/typesense_helper.py", line 65, in add_records
    failed_items = list(
  File "/home/seleuser/src/typesense_helper.py", line 67, in <lambda>
    filter(lambda r: json.loads(json.loads(r))['success'] is False, result)))
  File "/usr/lib/python3.10/json/__init__.py", line 339, in loads
    raise TypeError(f'the JSON object must be str, bytes or bytearray, '
TypeError: the JSON object must be str, bytes or bytearray, not dict
2023-02-10 19:38:19 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/isaac-typescript-definitions/enums/CollectibleSpriteLayer/>> (referer: <https://isaacscript.github.io/sitemap.xml>)

Jason Bosco

02/10/2023, 8:02 PM

Could you share the changes you made to that file?

James Nesta

02/10/2023, 8:02 PM

To what file?

Jason Bosco

02/10/2023, 8:02 PM

src/typesense_helper.py

Jason Bosco

02/10/2023, 8:02 PM

Or you didn’t make any changes?

James Nesta

02/10/2023, 8:03 PM

Well, you told me to add this:

Copy code

'symbols_to_index': '_',

I did that, and then I got an error complaining that "symbols_to_index" needed to be an array, so I assume that you just made a typo. Then, I updated it to be this:

James Nesta

02/10/2023, 8:04 PM

Copy code

'symbols_to_index': ['_'],

James Nesta

02/10/2023, 8:04 PM

That got me past the error, and it started crawling my website, but then I got the error that I pasted above.

Jason Bosco

02/10/2023, 8:28 PM

Could you add

print(result)

right before this line and share that ouput?

James Nesta

02/10/2023, 8:49 PM

Copy code

[{'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}]

Jason Bosco

02/10/2023, 8:49 PM

Is that just before it errors out?

James Nesta

02/10/2023, 8:49 PM

Well, the first one is

Copy code

[{'success': True}]

James Nesta

02/10/2023, 8:50 PM

But it generates the longer one after each error.

James Nesta

02/10/2023, 8:50 PM

e.g.

James Nesta

02/10/2023, 8:50 PM

Copy code

james@2wsx:~/crawler$ ./run.sh
[{'success': True}]
2023-02-10 20:48:58 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/>> (referer: None)
Traceback (most recent call last):
  File "/home/seleuser/.local/share/virtualenvs/seleuser-AdYDHarm/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks
    current.result = callback(  # type: ignore[misc]
  File "/home/seleuser/src/documentation_spider.py", line 180, in parse_from_start_url
    self.add_records(response, from_sitemap=False)
  File "/home/seleuser/src/documentation_spider.py", line 152, in add_records
    self.typesense_helper.add_records(records, response.url, from_sitemap)
  File "/home/seleuser/src/typesense_helper.py", line 66, in add_records
    failed_items = list(
  File "/home/seleuser/src/typesense_helper.py", line 68, in <lambda>
    filter(lambda r: json.loads(json.loads(r))['success'] is False, result)))
  File "/usr/lib/python3.10/json/__init__.py", line 339, in loads
    raise TypeError(f'the JSON object must be str, bytes or bytearray, '
TypeError: the JSON object must be str, bytes or bytearray, not dict
[{'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}]
2023-02-10 20:48:58 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/isaac-typescript-definitions/enums/ConstantStoneShooterVariant/>> (referer: <https://isaacscript.github.io/sitemap.xml>)
Traceback (most recent call last):
  File "/home/seleuser/.local/share/virtualenvs/seleuser-AdYDHarm/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks
    current.result = callback(  # type: ignore[misc]
  File "/home/seleuser/src/documentation_spider.py", line 172, in parse_from_sitemap
    self.add_records(response, from_sitemap=True)
  File "/home/seleuser/src/documentation_spider.py", line 152, in add_records
    self.typesense_helper.add_records(records, response.url, from_sitemap)
  File "/home/seleuser/src/typesense_helper.py", line 66, in add_records
    failed_items = list(
  File "/home/seleuser/src/typesense_helper.py", line 68, in <lambda>
    filter(lambda r: json.loads(json.loads(r))['success'] is False, result)))
  File "/usr/lib/python3.10/json/__init__.py", line 339, in loads
    raise TypeError(f'the JSON object must be str, bytes or bytearray, '
TypeError: the JSON object must be str, bytes or bytearray, not dict
[{'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}]
2023-02-10 20:48:59 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/isaac-typescript-definitions/enums/CollectibleAnimation/>> (referer: <https://isaacscript.github.io/sitemap.xml>)
Traceback (most recent call last):
  File "/home/seleuser/.local/share/virtualenvs/seleuser-AdYDHarm/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks
    current.result = callback(  # type: ignore[misc]
  File "/home/seleuser/src/documentation_spider.py", line 172, in parse_from_sitemap
    self.add_records(response, from_sitemap=True)
  File "/home/seleuser/src/documentation_spider.py", line 152, in add_records
    self.typesense_helper.add_records(records, response.url, from_sitemap)
  File "/home/seleuser/src/typesense_helper.py", line 66, in add_records
    failed_items = list(
  File "/home/seleuser/src/typesense_helper.py", line 68, in <lambda>
    filter(lambda r: json.loads(json.loads(r))['success'] is False, result)))
  File "/usr/lib/python3.10/json/__init__.py", line 339, in loads
    raise TypeError(f'the JSON object must be str, bytes or bytearray, '
TypeError: the JSON object must be str, bytes or bytearray, not dict
^C[{'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}]
2023-02-10 20:49:00 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/isaac-typescript-definitions/enums/CollectibleType/>> (referer: <https://isaacscript.github.io/sitemap.xml>)
Traceback (most recent call last):
  File "/home/seleuser/.local/share/virtualenvs/seleuser-AdYDHarm/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks
    current.result = callback(  # type: ignore[misc]
  File "/home/seleuser/src/documentation_spider.py", line 172, in parse_from_sitemap
    self.add_records(response, from_sitemap=True)
  File "/home/seleuser/src/documentation_spider.py", line 152, in add_records
    self.typesense_helper.add_records(records, response.url, from_sitemap)
  File "/home/seleuser/src/typesense_helper.py", line 66, in add_records
    failed_items = list(
  File "/home/seleuser/src/typesense_helper.py", line 68, in <lambda>
    filter(lambda r: json.loads(json.loads(r))['success'] is False, result)))
  File "/usr/lib/python3.10/json/__init__.py", line 339, in loads
    raise TypeError(f'the JSON object must be str, bytes or bytearray, '
TypeError: the JSON object must be str, bytes or bytearray, not dict
^C[{'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}]
2023-02-10 20:49:00 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/isaac-typescript-definitions/enums/ConstantStoneShooterSubType/>> (referer: <https://isaacscript.github.io/sitemap.xml>)
Traceback (most recent call last):
  File "/home/seleuser/.local/share/virtualenvs/seleuser-AdYDHarm/lib/python3.10/site-packages/twisted/internet/defer.py", line 892, in _runCallbacks
    current.result = callback(  # type: ignore[misc]
  File "/home/seleuser/src/documentation_spider.py", line 172, in parse_from_sitemap
    self.add_records(response, from_sitemap=True)
  File "/home/seleuser/src/documentation_spider.py", line 152, in add_records
    self.typesense_helper.add_records(records, response.url, from_sitemap)
  File "/home/seleuser/src/typesense_helper.py", line 66, in add_records
    failed_items = list(
  File "/home/seleuser/src/typesense_helper.py", line 68, in <lambda>
    filter(lambda r: json.loads(json.loads(r))['success'] is False, result)))
  File "/usr/lib/python3.10/json/__init__.py", line 339, in loads
    raise TypeError(f'the JSON object must be str, bytes or bytearray, '
TypeError: the JSON object must be str, bytes or bytearray, not dict
^C^C[{'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}, {'success': True}]
2023-02-10 20:49:01 [scrapy.core.scraper] ERROR: Spider error processing <GET <https://isaacscript.github.io/isaac-typescript-definitions/enums/CollectibleSpriteLayer/>> (referer: <https://isaacscript.github.io/sitemap.xml>)

Jason Bosco

02/10/2023, 9:30 PM

Looks like the typesense-python version in the lockfile is a previous version, which has slightly different behavior which is what is causing this issue.

Jason Bosco

02/10/2023, 9:30 PM

Could you lock the typesense package to this version in

Pipfile

Copy code

typesense = "==0.10.0"

Jason Bosco

02/10/2023, 9:30 PM

Then run

pipenv install

and run the scraper again

James Nesta

02/10/2023, 9:34 PM

It seems to be scraping now without any errors, thanks. Do you want me to add that to the PR?

🎉 1

Jason Bosco

02/10/2023, 9:38 PM

Yeah, that would be great

James Nesta

02/10/2023, 9:40 PM

Merge this now maybe? https://github.com/typesense/typesense-docsearch-scraper/pull/16

Jason Bosco

02/10/2023, 9:41 PM

Is that what you’re using now to run the scraper?

James Nesta

02/10/2023, 9:42 PM

Indeed, as it was the only way to build it, as we discussed yesterday.

Jason Bosco

02/10/2023, 9:42 PM

Got it, in that case, I’ll make the change to typesense version in pipefile in that PR

James Nesta

02/10/2023, 9:42 PM

Also I noticed that you might want to close this one, as the author seems MIA: https://github.com/typesense/typesense-docsearch-scraper/pull/3

James Nesta

02/10/2023, 9:43 PM

When the crawler finishes, is it supposed to return me to my shell? It seems to be hanging.

Jason Bosco

02/10/2023, 9:43 PM

It should yeah

Jason Bosco

02/10/2023, 9:43 PM

What are the last say 20 lines?

James Nesta

02/10/2023, 9:43 PM

Copy code

DEBUG:typesense.api_call:Making post /collections/isaacscript_1676064791/documents/import
DEBUG:typesense.api_call:Try 1 to node <http://isaacracing.net:8108|isaacracing.net:8108> -- healthy? True
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): <http://isaacracing.net:8108|isaacracing.net:8108>
DEBUG:urllib3.connectionpool:<https://isaacracing.net:8108> "POST /collections/isaacscript_1676064791/documents/import HTTP/1.1" 200 None
DEBUG:typesense.api_call:<http://isaacracing.net:8108|isaacracing.net:8108> is healthy. Status code: 200
['"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"']
DEBUG:typesense.api_call:Making post /collections/isaacscript_1676064791/documents/import
DEBUG:typesense.api_call:Try 1 to node <http://isaacracing.net:8108|isaacracing.net:8108> -- healthy? True
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): <http://isaacracing.net:8108|isaacracing.net:8108>
DEBUG:urllib3.connectionpool:<https://isaacracing.net:8108> "POST /collections/isaacscript_1676064791/documents/import HTTP/1.1" 200 None
DEBUG:typesense.api_call:<http://isaacracing.net:8108|isaacracing.net:8108> is healthy. Status code: 200
['"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"', '"{\\"success\\":true}"']

James Nesta

02/10/2023, 9:44 PM

If I had to guess, it just finished going through all the URLs, but maybe its choking on some final processing, or something.

Jason Bosco

02/10/2023, 9:45 PM

Hmm, I’ve never it hang like this… May be try restarting?

Jason Bosco

02/10/2023, 9:47 PM

Btw, you had to make a chrome version upgrade right? Since you’re anyway updating code, could you also now add typesese pinning to pipfile

James Nesta

02/10/2023, 9:48 PM

I already did.

Jason Bosco

02/10/2023, 9:48 PM

I’ve merged in the PR that you’re using

Jason Bosco

02/10/2023, 9:48 PM

After merging that, there’s now a conflict in your PR. could you resolve that?

James Nesta

02/10/2023, 9:51 PM

Just did.

👍 1

James Nesta

02/10/2023, 9:59 PM

Ok, I ran it again, and the second time it completed successfully.

James Nesta

02/10/2023, 9:59 PM

However, when I search for "TEAR_FALLING_ACCELERATION" on my website, it still shows up as 0 results, so it looks like the modification didn't accomplish anything.

Jason Bosco

02/10/2023, 10:20 PM

Hmm, ok let’s try one more thing. Instead of

symbols_to_index: ['_']

, could you change that to

token_separators: ['_']

and then rerun the scraper?

James Nesta

02/10/2023, 10:21 PM

Actually, I realized something. As a hack, I tried running Prettier on the Markdown, which seems to remove the escape characters automatically, so I'll add that to my build pipeline, and see if it fixes the underlying problem.

James Nesta

02/10/2023, 10:33 PM

Yeah, it looks like using Prettier fixes things:

James Nesta

02/10/2023, 10:33 PM

message has been deleted

Jason Bosco

02/10/2023, 10:41 PM

Yaaay!

James Nesta

02/10/2023, 11:23 PM

Some of my webpages are not showing up in the search. For example, this page: https://isaacscript.github.io/isaacscript-common/other/enums/ModCallbackCustom I suspect that it is because there are too many elements on the page. In Algolia land, there is a really nice GUI that tells you the specific pages that had 404s or otherwise had errors. Is there a way to get that kind of functionality from the Typesense crawler? Are there any blogs you can point me towards that explain how to start troubleshooting this kind of thing?

Jason Bosco

02/10/2023, 11:24 PM

You want to look the scraper logs… Looks like you already have DEBUG logs turned on based on what you shared earlier

Jason Bosco

02/10/2023, 11:25 PM

Search for

Too much hits, DocSearch only handle

Jason Bosco

02/10/2023, 11:25 PM

If you search for that in the scraper codebase, you can change that value

James Nesta

02/10/2023, 11:25 PM

I did save the output, but I don't get any results for "error", and all the entries relating to "ModCallbackCustom" look to be normal messages indicating that the page was properly ingested. I don't have any results for "Too much hits". Anything more specific that I should be looking for?

James Nesta

02/10/2023, 11:26 PM

I can throw the output on pastebin if that is helpful.

Jason Bosco

02/10/2023, 11:26 PM

Hmm then may be the issue is something else

Jason Bosco

02/10/2023, 11:27 PM

If the issue is about too many elements on the page that’s the error you’ll see, per the code: https://github.com/typesense/typesense-docsearch-scraper/blob/37334bbcea17df8eedeeb82200815a4fe8e02759/scraper/src/documentation_spider.py#L154

Jason Bosco

02/10/2023, 11:28 PM

Ah, found the issue

Jason Bosco

02/10/2023, 11:29 PM

Same as this: https://typesense-community.slack.com/archives/C01P749MET0/p1675987110082129?thread_ts=1675979239.202479&cid=C01P749MET0

Jason Bosco

02/10/2023, 11:29 PM

In your scraper config ^

James Nesta

02/10/2023, 11:33 PM

That worked, great.

James Nesta

02/10/2023, 11:33 PM

Should this be documented somewhere? I feel like the TypeSense docs should tell you to do that.

Jason Bosco

02/10/2023, 11:34 PM

It’s very docusaurus specific… Let me see if there’s a better docusaurus config that we can just link to

Jason Bosco

02/10/2023, 11:35 PM

Looks like Docusaurus’s doc site has since moved to Algolia’s proprietary crawler for their search

Jason Bosco

02/10/2023, 11:36 PM

So yeah would be good to call this out as one of the bullet points you added

James Nesta

02/10/2023, 11:37 PM

Do you want me to do another PR?

Jason Bosco

02/10/2023, 11:37 PM

Yeah that’s on a different repo

James Nesta

02/10/2023, 11:37 PM

Aye, I already did a PR to typesense-website yesterday.

Jason Bosco

02/10/2023, 11:38 PM

I already merged that in

James Nesta

02/10/2023, 11:38 PM

Right, I was asking if you wanted me to do another PR to

typesense-website

, or if you wanted to take care of it.

Jason Bosco

02/10/2023, 11:38 PM

Ah, it would be great if you can do another PR

James Nesta

02/10/2023, 11:45 PM

Ok, I'm starting on it now.

James Nesta

02/10/2023, 11:45 PM

I have a question about pricing. If I pay for a cloud-hosted Typesense by you guys, would you also automatically index it in the cloud as well?

Jason Bosco

02/10/2023, 11:46 PM

No, we only host the Typesense Cloud cluster. The scraper is something you’d host on your side (typically in your CI pipeline, you’d trigger the scraper to run post deploy of your docs site)

James Nesta

02/10/2023, 11:46 PM

Oh, that makes perfect sense, thank you.

👍 1

James Nesta

02/10/2023, 11:46 PM

I'll add that to the PR as well.

Jason Bosco

02/10/2023, 11:47 PM

I think there’s a tip section in there already that might mention this… about triggering it from your CI pipeline

James Nesta

02/11/2023, 12:54 AM

https://github.com/typesense/typesense-website/pull/162/files

👍 1

Open in Slack

Previous Next