-
Notifications
You must be signed in to change notification settings - Fork 1.1k
[mdanalysis] no new search results after adding new sitemaps to sitemapindex #4699
Comments
* new text selectors for definitions (dt tags) in dl (definition lists) as used in the sphinx-generated API docs (to index the function/class names themselves); dd (data tags) do not need a selector because text content is wrapped in p tags inside the dd and is already selected * new text selectors for pre (code blocks) * see issue algolia#4699
|
Users will receive access to the new infrastructure on a random basis. Sorry, we are a bit early in the process right now to do bigger batches/deploy certain configs! You can read more here: https://docsearch.algolia.com/docs/migrating-from-legacy#migration-seems-to-have-started-but-i-dont-have-received-any-emails
We don't grant access to the dashboard but only the Analytics, sorry! (But it will be available in the new infra :D) Missing pages
I'd suggest you to adapt the Below, the URLs matching We can see that some selectors don't match certain pages. To make it more specific, you can use No indexThere's was a typo in your config (sorry I didn't saw it), I've fixed it in: #4712 |
|
I installed the docsearch-scraper locally and I’m able to run it so I can now debug more easily. |
Well... maybe not that simple:
Will need to see how to work within these limitations. |
I am now using a scraper with disabled index submission for testing, see orbeckst/docsearch-scraper#1. |
That's a good idea! It would be nice to see it as an option indeed Let me know if I can help you debug your issue |
Many of the missing terms are due to broken sitemaps. Apparently, somewhere our Sphinx + GitHub actions based doc deployment changed and the sitemaps now contain a version information that is not actually present in the deployment URL. That's a problem on our end. I'll leave this issue open for the moment. |
PR #4751 addresses some of the problems and we also fixed sitemaps. The PR has some more comments on what still seems to be missing, including the output from the scraper (for 0 record pages). Any insights why we're still missing content would be appreciated. Thanks! |
As mentioned in PR #4751 there are still a number of "0 records" pages, namely under
From #4751 (comment) :
Debugging the selectors is the next step... |
Bug: no search results after adding new sitemaps
If it is a DocSearch index issue, what is the related
index_name
?index_name
= mdanalysisWhat is the current behaviour?
We added two new sitemaps to our siteindex (see Any other feedback below for details) for "distopia" and "pytng". Searching for new unique content does not give any results:
If the current behaviour is a bug, please provide all the steps to reproduce and screenshots with context.
To perform search, go to https://www.mdanalysis.org/ and use the search box as show in the screen shot:
The screen shot shows that the unique term CalcBondsOrtho (for distopia) is not found, as explained in more detail below:
distopia content is not found
distopia failed example text
distopia failed example API docs
Note that this example probably fails because content is in a dl (definition list) with dt/dd tags:
pytng content is not found
pytng failed example text
pytng failed example API docs
Note that this example probably fails to find the API doc because content is in a dl (definition list) with dt/dd tags, the text use should have been found
What is the expected behaviour?
Relevant pages from the distopia and pytng docs are found, as indicated above. (It was clearer to include the expected results above for the individual examples).
What have you tried to solve it?
Any quick clues?
Some content (for example, the API docs in https://www.mdanalysis.org/distopia/api/distopia.html) are stored in definition lists (dl with dt/dd elements) and the dt tags are NOT included as selectors in the algolia config file yet. (see PR #4700)
However, no idea why standard text is not appearing; seeing the scraper output might help but that requires algolia staff help.
Any other feedback / questions ?
We added two new sitemaps to our sitemap index https://www.mdanalysis.org/sitemapindex.xml for
Our open issues:
The text was updated successfully, but these errors were encountered: