Monday, 22 July 2013

SharePoint 2013 Search Overview

Overview:  This post explains how SharePoint 2013 Search works.  SharePoint 2013 search is the latest search within the SP product and replaces SP and Fast search that was used with SP2010.

6 Components:
  1. Query
  2. Index
  3. Analytics
  4. Content Processing
  5. Crawl
  6. Admin
Tip:  Group Query & Index roles on the same server.  Then group Analytics & Content processing.  Group Crawl & Admin (you can also add content processing to thses servers).

Installing search on SP2013 creates 4 database (all db's by default use the 'Simple' recovery model):
  1. Search Admin,
  2. Search analytics Reporting,
  3. Search Crawl, and
  4. Search Links.
A guideline from Microsoft is: "Add one index partition for every 10 million items in the search index."  This depends on how you are using search, so if you have more documents per index but have few queries or can live with longer response times or your kit can deal more queries this can be increase or decreased.
Index partions are splitting the data vertically, so if you have 25 million search items and want less than 10 million per partition, you will need 3 index partions on the 3 index servers.  You do not have redunancy, so if any index goes down your search is broken.  Inder replicas as the name suggests is a copy.  I think of this as horizontal scaling.  So if you want HA on your 25 million item search farm, you need another 3 index (Replica) partitions.  You have 6 index servers.  Using index replicas will improve query results speed.

Search Architectures for SharePoint 2013 - From MS (Kavindra Palaraja out of his oit2013-model-sharepoint-search-architecture.pdf document).  This is not my diagram but it explains the components nicely.
Search Components in SharePoint 2013
Replicas and Index partions explained are refered to as Rows and Columns. 
Add a new Index partion when the number of documents in the index exceeds 30 million.

SharePoint 2013 supports 3 types of Crawls:
  1. Full (SP2010) 
  2. Incremental (SP2010)and
  3. Continuous.
  • The continuous crawl on works on SP2013 content and shall display content in the crawl results as soon as it is indexed ( it doesn't wait for the crawl to complete).
  • It is a good idea to run incremental crawls as they index more data than just SP2013 data and continuous crawl does not process or retry items that return errors and the incremental crawl shall clean theses items up.

More Info:
Capacity management and sizing overview for SharePoint Server 2013
SP2013 Stretch Farms
SP2013 Database types and desc

Design Goal - Index partitions and Index replicas.

Partitions marked in Red.

Note: 2016/11/16 - Adding email messages e.g. msg to SharePoint has always crawled the data however in MOSS and SP2010, the attachments do not get crawled.  SP2013 (it may be since SP1) and Office 365 will also inde the attachments of messages saved in SharePoint.


Post a Comment