Architecture Overview

ARCHIVE is deployed as a multi-node cluster. Each node runs a sub-set of services that can be deployed differently, depending on the load on each machine. In its most basic configuration, ARCHIVE can function with a single Archive node acting as Master that hosts the Search service, as well as a single Index server node. Worker nodes are optional. Master, Worker and Search nodes are deployed on Windows 2016 virtual machines. Index Server nodes are Linux virtual machines based on Centos 7.5 OS.

ARCHIVE typically runs more than two nodes. The size of the cluster depends on a variety of factors:

Number of users
Volume of archived data to be searched
Volume of live email to be audited
Frequency that the data is accessed
Frequency and scope of eDiscovery requests
Configuration of the mail environment
File indexing and cloud server requirements

ARCHIVE connects to different cloud and on-premise data sources to collect data. The results of these collections can be:

Archived in attached data stores
Indexed to allow for search and eDiscovery on the content using IPRO Search.
Both archived and indexed

The current out-of-the-box data sources supported are:

On-Premise

Exchange
Windows File System
SharePoint
GroupWise
Outlook PSTs

Cloud

Exchange O365
SharePoint
OneDrive
Box for Business
ShareFile
Egnyte

ARCHIVE accesses data using a connector approach that allows for easy access to additional data sources.

The following diagram represents a typical cluster consisting of a Master Archive server and two Worker nodes, as well as two Index servers.