The following are Frequently Asked Questions (FAQs) in Enterprise Archive application:

Search

What is the difference between a Network and a Communication Channel?

A Network refers to the source of archived communication. For example, MS Teams, Slack, Journal, Zoom, and so on. The Network filter in the Search page allows you to search for communications archived from a specific networks. By default, Enterprise Archive searches for communications from all the networks recorded.

A Communication Channel refers to the mode of communication within a network. For example, emails, collaboration, chat, voice, videos, blogs, and so on, which can be from one or more networks. The Communication Channels filter in the Search page allows you to s earch for communications archived from specific communication channels or modality of communication. By default, Enterprise Archive searches for communications from all the channels recorded.

Can phrases be searched using List of Terms section?

Yes, phrases can be searched using List of Terms section.

What is the difference while performing a phrase search using single quote, double quotes and without quotes under List of Terms section?

Phrase searches can be performed without using double quotes.

It is advisable not to use single or double quotes when performing phrase searches in the List of Terms section.

When a phrase is expected to match results in exactly the same order, please use them in a single continuous line without using single or double-quotes.

Note

While the single quotes do not change the result set, double quotes end up breaking the phrase in individual words and the results contain all documents having at least one of these words, thus returning a much larger set of results.

Are special characters indexed and searchable?

The application search and indexing behavior is explained below and consists of the following components:

Enterprise Archive Index Store: Enterprise Archive indexes all the entries that a user can input. These entries are classified as keyword or text. Keyword entries are user inputs that should be searched as is, without analyzing them. Analyzing is the process of stripping (or transforming) user entries by tokenizing them, converting to lowercase, ASCII folded and so on. Text type fields are analyzed and any of the above actions are performed on them. Text fields refer to user content such as subject, body, and attachment text. All other user entries are keywords.
Querying the Enterprise Archive Index Store: Querying refers to searching for specific terms or phrases that are indexed. Anything that is not indexed cannot be searched in Enterprise Archive. Querying for special characters will not return any search results as the application analyzes text inputs and strips them of special characters.

How to search inside a X-header in the UI?

To search for specific X-header values in email conversations:

Navigate to Archive Management > Search.
Click and expand Words/Phrases.
Specify the X-header or the complete X-header name and value together in the Must have at least one of these keywords under the List of Terms section.
Enable the System check box in the Where to Search pane.

Exports

What is the resulting file type when I export archived social media communications as Native from Enterprise Archive?

When you export documents from the following media as Native, the resulting file type is as follows:

Original Media	Content Source	Resulting File Type
Facebook IM	Socialite	HTML
Facebook Posts	Socialite	HTML
LinkedIn	Socialite	HTML
Bloomberg IM	Vantage	HTML
Bloomberg IM	API	HTML
Bloomberg EML	Vantage	EML - If classification is set as Email during ingestion HTML - If classification is any other type other than Email.
Bloomberg EML	API	EML - If classification is set as Email during ingestion HTML - If classification is any other type other than Email.
Outlook	EGW	EML
MSG Files	ITM XML	If attribute under ITM XML is as follows: `MS-OXProps-Version =<"1" "2" or "3">` `MS-OXProps-Message ="<any value>"` Native Output: MSG Else, Native Output: EML
EML Files	ITM XML	EML

Why does data appear truncated in the exported .csv file?

In a CSV export, Bcc participants are only limited to 50.
When a document containing multiple participants (for example, more than 500 in the To/Cc/Bcc fields) is exported as CSV, the participants listing in the file gets truncated to multiple rows. This is Microsoft Excel issue. As a workaround, Smarsh recommends to use Notepad++ to view such CSV files.
Exported CSV files for Tier 2 data support the following fields:
- snapshot_time (Represents the sent time of the message as displayed in the UI)
- files_count
- archived_time
- subject
- users_to_cc_bcc
- users_bcc
- gcid
- channel
- messageId
- interaction_id
- network
- transcript_id
- users_from
- users_to
- users_cc
- key
- cms_collected_item_id
- cms_hold_flag

What is the file size limit of an export?

If the size of an exported package exceeds 1 GB , the package is divided into multiple chunks. This applies for both ZIP and PST files, and each chunk will not exceed 1 GB in size.
The UI displays the progress for long-running exports, particularly for exports larger than 4 GB. A notification message will indicate that a large export is in progress and may take longer to complete.

Scheduled Exports

Is there a Failover/Failback mitigation plan for Scheduled Exports?

In a typical Disaster Recovery (Failover(FO)/Failback(FB)) setup in Enterprise Archive, upon failover to the Secondary site, ongoing scheduled exports will not continue by default and remains in a Failed state. This is because there will be no DR event created and the Secondary site will not understand nor will have any information about the ongoing export and its progress.

As a mitigation plan, for the frequently triggered scheduled exports to continue or to not lose any data due to site swap (FO/FB), Smarsh SRE team will perform certain configurations manually on the scheduled exports template. This will enable the ongoing scheduled export to resume and also ensure that the delta of Failed documents are synced to the Secondary site.

Limitations

Here are a few limitations that are observed once the synchronization is complete:

When documents are filtered/searched with a specific “Date Range”, the search results will be different in the Primary and Secondary site. The results will not match as the processed/archive time of the documents on both the sites are different. That is, same documents that are a part of the Primary site for a specified date range cannot be expected in Secondary site. This also corresponds to any lag for items being archived in Primary site and within 72 hours on the Secondary site.
When there are no documents available for a specified date or date range in the DR but was available in Primary before the DR event, it implies that synchronization between Primary and Secondary wasn't completed. In such cases any exports for that date or date range will complete with zero documents.

The different in time is there between the Primary and Secondary site cannot be derived beforehand.

Administration

What are the variables available in Enterprise Archive?

The following variables in Enterprise Archive can are exposed to create your own Alert or Notifications templates.

Variables Used	Description
`${BRANDNAME}`	Brand name used for the product, which is Communications Intelligence Platform.
`${FIRSTNAME}`	First Name of the user to whom these alert emails are triggered.
`${LASTNAME}`	Second Name of the user to whom these alert emails are triggered.
`${URL}`	Link to a corresponding page for the user to click for more information.
`${EMAIL}`	Email address of the user.
`{PWDIMAGENAME}`	Image where users temporary password is displayed.
`${QUERYNAME}`	Name of a Supervision policy.
`{QUEUECREATOR}`	Name of the user who created the Supervision Policy.
`${ACTUALHITS}`	Number of policy hits. That is, the total number of matches in words or phrases according to the Supervision policy.
`${CIRCUITBREAKERLIMIT}`	The specified circuit breaker limit for Flag policies.
`${REPORTNAME}`	Name of the report selected while creating a scheduled export.
`${MODE_OF_SCHEDULE}`	The mode of schedule used for the report such as, Run Once, Run Frequently, or Run Now.
`${TIME_ZONE}`	The time zone set for the scheduled report.
`${DATE_RANGE}`	The date range specified for the scheduled report.
`${EXPORT_NAME}`	Name of the export defined while creating an export job.
`${STATUS}`	The status of the export job, such as Successful, Failed, or Disposed.
`${SCHEDULE_REPORT_NAME}`	Name of the report defined by the user while creating a scheduled export.
`${REPORT_COUNT}`	Number of reports failed to generate.
`${FAILURE_SUMMARY}`	Summary of failed reports.
`${QUEUENAME}`	Name of a Queue in a Supervision policy.
`{MIN_CHARS_BEFORE_WILDCARD}`	The minimum number of characters set for wild card search to initiate. That is, in Enterprise Archive, the character limit is three by default. Therefore a wildcard search will function only if there are a minimum of three characters before a wildcard.

How do I keep a track of user updating their passwords?

You can track all create, update, and delete operations performed in Enterprise Archive UI by looking viewing the User Activity Logs. Password updates and reset can also be tracked by enabling the Update check-box under the Actions section.

What cookies are used by Enterprise Archive?

The following cookies are used by Enterprise Archive:

Cookie Type	Name	Description
Persistence (1000 days)	saved-theme	Selected UI theme (Defaults to ' alcatraz ').
Persistence (1000 days)	tf	User preference - Time Format
Persistence (1000 days)	tz	User preference - Timezone
Session	uuid	Session ID
Persistence (10000 days)	validLogin	Boolean value to check valid login.
Session	xdcc	Double Cookie Check Token to avoid CSRF attack.

Archive Management

What are the file size limitations in Enterprise Archive?

See Enterprise Archive Conversation and File Size Limitations.

What is Retention Policy?

Retention policy deals with defining and managing retention periods for the archived data. Retention management allows an administrator to retain data in the archive, only for the required duration. Retention periods must be managed to ensure that data is stored in the archive only as long as it is necessary for the organization. Retention management also ensures that data is not removed prematurely by accident before the retention period expires.

What is Preservation Policy?

Post Enron crisis, Federal Rules of Civil Procedures (FRCP) amendments required organizations to preserve Electronically Stored Information (ESI) of employees when there is a reasonable anticipation of litigation between communicating parties. ESI can be preserved at both document and employee (or custodian) level.

Preservation Policy allows organizational level hold to be placed on certain custodians whose content needs to be protected at all times with or without being associated to cases. Preservation Policies also provides an efficient mechanism to ensure content is protected quickly without competing with disposition when millions of documents need to be put on hold. The Preservation Policies do not completely eliminate the risk from losing documents to disposition process but significantly reduce the risks. The document level preservation in Enterprise Archive is Legal Hold.

What happens when a Preservation/Custodian hold is present, but Retention policy expires on a document?

Even though the configured retention policy expires, the document will not be deleted or disposed since the participant is on prevervation hold.

Example

Document Status

Disposition Status after 30 days

Start Time: Jan 01, 2022

Retention Policy: 30 Days

Preservation/Custodian hold Status: Applied

DOCUMENT NOT DISPOSED

When disposition triggers on 1st Feb 2022, the document will not be deleted or disposed since participant is on preservation hold.

What happens when Preservation/Custodian hold is removed, and Retention policy expires?

The document will be deleted or disposed.

Example

Document Status

Disposition Status after 30 days

Start Time: Jan 01, 2022

Retention Policy: 30 Days

Preservation/Custodian hold Status: Removed

DOCUMENT DISPOSED

When disposition triggers on 1st Feb 2022, the document will be deleted or disposed.

Case Management

How does Retention work for a Case with two participants that have different retention values?

If a message in a Case has two participants that have different retention values set, what is the expected behavior?

Use Case

Case1 has a message with two participants (Brandon and Jeremy). Brandon's participant has a retention period set of 3 days. Jeremy's is 1 day. There are two reviewers on the Case - Reviewer1 and Reviewer2.

Questions

If the message has both participants, Brandon and Jeremy, after day 1, is the message purged?
If Reviewer1 (reviewer on the case) is restricted by his role to see only Jeremy (restriction policy), would Reviewer1 still see the message in the case after day one? If the retention period expires for Jeremy, is that message still returned when Reviewer1 searches.

Answer

The message would not be purged because Brandon's participant has a retention of 3 days. The message goes by lengthiest retention period.
Yes, Reviewer1 will be able to see the messages when searched.

A message is given various Retention Policies over its life cycle in the archive. In this case, it would have a 3-day retention period for Brandon, and a separate 1-day retention period for Jeremy.

If it was hit by supervision it may have a Supervision Queue driven retention period. Purge works by looking nightly for documents that no longer have ANY valid retention periods. Thus, as long as there is one valid retention period the message will not be purged. If the message is not purged, the message is still surfaced in all the different applications where it has been presented throughout the application. Therefore, the retention period in Supervision may expire, but there is a hold, or separate retention period that maintains the record. That record remains available in Supervision until all policies expire. Participant targeted retention periods have no bearing on the searches performed. The document is available and can be search by any of its attributes.

How is a case specific tag group different from a tag group?

Private tags groups, also known as case specific tag groups are visible only within a particular case. Private tags are confidential tags such as sexual harassment, racism, or employee termination and so on. Such tags must not be exposed to other case reviewers and hence private tags are visible only within a particular case.

Note

Private tags are not displayed in the reports as well.

For more information on how to create case specific tag group, see Creating Tag Groups and Adding Tags.

Supervision

What is Echo cancellation and how does it work?

See Echo-Cancellation for Conduct

Should Sampling Profile be used to filter participants/groups?

No, it is not mandatory for the user to use sampling profiles to filter participants/groups. Random selection will be employed if no sampling or policies are applied. For more information on Sampling Profile, see Configuring Sampling Profiles.

What is a circuit breaker and how does it function?

A circuit breaker is a limit specified in the Supervision application while configuring a Queue. The Supervision policy executing that queue is disabled beyond the configured limit. The following behavior is observed in a Supervision Flag and Ignore policy:

Circuit Breakers are designed to limit the maximum number of documents that can be flagged/ignored by a policy for a given queue. The aim of circuit breakers is to prevent policies from unnecessarily flooding review queues, with a large number of documents. This in turn will protect the SLAs defined for Queue completion in the system.

Flag Policies: The circuit breaker limit for Flag policies is specified in Enterprise Conduct. Enterprise Conduct disables a policy when it enters a circuit breaker situation. A circuit breaker situation arises when the number of documents that a policy can fetch exceeds the maximum number of documents defined in the Disable policy when reaching a max of <text box> hits parameter. In this scenario, no documents are fetched by the policy, the policy gets disabled, and the status of the policy changes to Disabled by system, and an email notification is sent to the queue creator. For example, the criteria of a policy is such that it is able to fetch 5000 documents, but the parameter Disable policy when reaching a max of <text box> hits is set to 2000. In such a case, when the policy runs and exceeds the limit of 2000 documents, it enters a circuit breaker situation, and gets disabled.

Ignore Policies: The circuit breaker limit for Ignore policies is set to 500000 by default. This is controlled by a server property setting. If the documents being returned exceed the circuit breaker limit for Ignore policies, Enterprise Conduct will disable the Queue and issue an email notification to the user regarding the same.

Note

When a conversation comes into a queue, and you delete the policy that had fetched that conversation, then the conversation continues to stay in the queue. This conversation is removed from the queue after you have completed the Conduct process by closing the conversation.
If a conversation is fetched by multiple policies, then the conversation is displayed in the queue only once. In this case, the Matched policy panel displays all the policies that fetched the conversation, and all policy types are displayed.

Also see

Conduct Policy FAQs