Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Why would certain Sitecore content not be indexed by Solr? [closed]

I have a site built using Sitecore 7.5 and Solr 4.7. I am noticing a very strange phenomenon where certain content items are not getting indexed. Here is what is happening.

I have a data template called "Content Page". Most of the content items that are using this data template live in /sitecore/content/Home/Corporate. Things like this:

/sitecore/content/Home/Corporate/About-Us /sitecore/content/Home/Corporate/Careers /sitecore/content/Home/Corporate/Terms-Of-Use 

There are a few test items that I created that live in /sitecore/content like this:

/sitecore/content/Test-1 /sitecore/content/Test-2 

All of these pages are built using the "Content Page" data template. After I rebuild the index and then go to the Solr UI and do a query I can't find any of the items that live beneath the Home node. When I do a query for _templatename:"Content Page" - I only get Test-1 and Test-2 in the results. Does anyone know why this would be happening? It's really odd. I've never seen something where Solr just doesn't index certain items at all. And it seems to be based on where they are in the content tree.

like image 827
Corey Burnett Avatar asked Jun 30 '15 19:06

Corey Burnett


People also ask

How do I know if Solr is indexing?

In the left column, select 'liferay' under the Core Selector drop-down, and navigate to the query page. Make a query of any content that should be displayed in the Liferay UI. If the appropriate index results are outputted on the right, this confirms that the content is indexed.

How does SOLR work with Sitecore?

Populate an XML schema for SolrThis tool automatically populates Sitecore fields and makes sure all fields that Sitecore needs are present. You can add more fields to this schema by adding fields to the managed-schema file as long as you do not change the system index fields.

What is Sitecore indexing?

Sitecore Experience Platform (XP) has three search frameworks: Content Search, which is used by various components primarily to search and index Sitecore items. xConnect Search, which is used to search and index experience data. Commerce Search, which is used to search and index customers and orders.

How long is SOLR indexing?

Full index takes about 40 hours using DB. There are some factors that might slowing you down: Memory.


1 Answers

According to Pavel Veller on the Sitecore forums, answering a question similar to yours:

First, there are different fallback strategies. Field-level is most likely done via the StandardValuesProvider and yes, you need to have a version in the language for the fallback to kick in. We built a Language Tool wizard in SCORE that can create empty language versions to help in exactly this scenario. You can also do item-level fallback with ItemProvider which will make it appear as if an item had a version in the language even if it doesn't. The good thing about both is that you fallback on the Sitecore API level. Services like search crawlers and publishing all work via those APIs (until you wrote your own, of course) and as a result they all "see" the fallback. Publishing will actually "materialize" the fallback value and the crawler on CD will work with the items and field values as if those were real.

like image 136
Joshua Adriaanse Avatar answered Sep 24 '22 23:09

Joshua Adriaanse